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Effectiveness of Error Control in Data 
Communication over the Switched 


Telephone Network 


By R. L. TOWNSEND and R. N. WATTS 
(Manuscript received September 24, 1963) 


This article describes the resulis of a data communication experiment 
designed to investigate the effectiveness of error detection and retransmission 
in providing high-accuracy data transmission over the switched telephone 
network. Data were encoded into a Bose-Chaudhuri (31,21) error-detecting 
code and transmitted at 2000 bits per second by a DATA-PHONE data 
set 201A over a variety of dialed long-distance connections. Transmitted 
and received data were compared to obtain error data which were analyzed 
to obtain an estimate of the error performance of the data set and the effec- 
tiveness of the code. The results of this analysis are presented. 

During the test approximately 6.36 X 10° 31-bit code words or 1.97 X 
10° bits were transmitted. Of these, 63,002 bits appearing in 29,731 different 
code words were received incorrectly. Thus, the over-all bit error rate was 
3.19 X 10-° and the word error rate 4.67 * 107*. The decoder was suc- 
cessful in detecting all but two of the erroneous code words, resulting in an 
average undetected word error rate of 3.14 X 10-8 or an average of 9.85 X 
108 bits between undetected word errors. These results demonstrated that very 
low undetected error rates can be obtained in practice using an error detection 
and retransmission system of modest complexity. 


I. INTRODUCTION 


Much attention has been focused recently on the problem of trans- 
mitting digital data over the switched telephone network with a high 
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degree of accuracy. Selection and evaluation of error control schemes 
by which the desired high accuracy can be achieved require detailed 
information about the digital error statistics. Because of the complexity 
of the switched telephone network, the only feasible way to obtain this 
information is through the analysis of experimental data. 

A method of error control which offers promise for use with telephone 
facilities is error detection and retransmission. An experiment has been 
performed to explore the feasibility of this type of error control and to 
obtain useful statistical information about the switched telephone net- 
work. In this experiment, a DATA-PHONE data set 201A,! which is a 
4-phase unit designed for synchronous operation at 2000 bits per second, 
was used to transmit data over a variety of connections in the direct 
distance dialing network. The transmitted data were encoded into the 
Bose-Chaudhuri?? (31,21) code described in Appendix A, which had been 
selected on the basis of a computer study. Transmitted and received 
data were compared to obtain error data from which digital error sta- 
tistics were derived. 

The over-all results of the test are shown in Table I. This indicates 
that the decoder was successful in detecting all but two of the 29,731 
words received containing transmission errors. These results demon- 
strate the feasibility of providing high accuracy data transmission over 
the direct distance dialing network by using an error detection and 
retransmission system of modest complexity. A description of the error 
control equipment is given in Appendix A. 

A description of the test and an analysis of the numerical error data 
are presented in the remainder of the article. 


II. DESCRIPTION OF THE TEST 


The test was conducted between March 13 and August 31, 1962, 
during which time approximately 1.97 X 10° bits were transmitted. A 
portable transmitter was used to transmit data from various locations 
throughout the Continental United States to a stationary receiver 
located at Murray Hill, New Jersey from March 13, 1962 until May 1, 
1962, and then at Holmdel, New Jersey for the remainder of the test. 
All performance measurements were made at the receiving terminal. 

At both Murray Hill and Holmdel three foreign exchange lines were 
installed, one each to a No. 5 crossbar, No. 1 crossbar, and step-by-step 
central office. The characteristics of these lines are outlined in Table IT. 
Dialed connections were originated from the receiving terminal, which 
was so arranged that it could be connected to any of the three lines. At 
both receiving stations calls were distributed equally, as nearly as pos- 
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TABLE [— ExprerIMENTAL RESULTS OF DATA CoMMUNICATION 
OVER THE SWITCHED TELEPHONE NETWORK 











Number of transmitter locations 28 
Number of calls 548 
Number of hours of transmission 273 
Total bits transmitted 1.97 X 109 
Information bits transmitted 1.33 XK 109 . 
Words transmitted 6.386 X 107 
Number of bits in error (total) 6.30 X 10! 
Number of words in error 2.97 * 10! 
Number of undetected word errors 2 

Bit error rate 3.19 X 1075 
Word error rate 4.67 X* 10-4 
Undetected word error rate 3.14 X 1078 
Factor of improvement (word) 1.49 X 104 
Average bits between undetected word errors 9.85 10° 





sible, among the three foreign exchange lines. The duration of each call 
was approximately 30 minutes. 

The transmitting terminal was moved to the locations listed in Table 
III. These were selected on the basis of their distance from the receiving 
terminal, types of connecting facilities and type of end switching office. 
Since one objective of the experiment was to collect and to analyze data 
transmitted over typical connections, the locations selected were in or 
near large metropolitan areas where data traffic is likely to be heaviest. 

A pseudo-random sequence generator was used to produce a repetitive 
pattern of 511 distinct 31-bit code words. These were transmitted serially 
at 1000 bauds or 2000 bits per second by a DATA-PHONE data set 
201A. Received data were demodulated with another data set 201A 
and then compared with the output of a synchronized, duplicate se- 
quence generator. The output of the receiver and system performance 
information were recorded on magnetic tape. Error data also were 
recorded by means of electronic event counters. A test log was kept which 


TABLE II— Recervine ENp Test LINES 











ipo Ty Pree Location of CO Line No, | Line aon 0 Type of Lie 
MH #5XB | New Providence, N. J. | 4641116 3.4 db | H-88 
MH X1XB | Plainfield, N. J. PL68684 9.8 db | H-88 
MH SXS Carteret, N. J. 5414054 13. db | H-88 
HO X5XB | Holmdel, N. J. 9464674 5.3 db | H-88 
HO ¥1XB | Rahway, N. J. 3814270 10.4 db | H-88 & 
N carrier 
HO SXS Monmouth Junction, IA96550 | 11 db | H-88 & 
N. J. N carrier 
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Trans- 
TranemitigrTocation | CO | Date | office Type | Nojot| mision | Total Bits 
(Hours) 
Rahway, N. J. FU8 3/15/62 | XIXBAR} 11 5.5 3.96 X 107 
Passaic, N. J. PR8 3/13/62 | 1 XBAR 9 4.62 | 3.32 X 107 
Paterson, N. J. MU3 3/22/62 | ¥IXBAR fi 3.22 | 2.32 X 107 
Ridgewood, N. J. 444 3/27/62 | X5XBAR| 10 4.88 | 3.52 X 107 
Mentetien (N.Y.C.),| 349 4/18/62 | ¥1XBAR | 10 4.83 | 3.48 X 107 
Meet tan (N.Y.C.),| HA5 4/23/62 | ¥IXBAR| 18 6.5 4.68 X 107 
Menberan (N.Y.C.),} LT1 4/20/62 | ¥5XBAR| 12 6.17] 4.44 x 107 
Manhattan (N.Y.C.),| RI9 5/11/62 | #1XBAR| 14 7 5.04 X 107 
Menbetian (N.Y.C.),| UN1 5/10/62 | ¥5XBAR 9 4.45 | 3.20 X 107 
Ni; 

Brooklyn, N. Y. JA2 5/16/62 | ¥IXBAR| 12 6 4.32 X 107 
Queens, N. Y. 445 5/18/62 | #5XBAR} 12 6 4.32 X 107 
Freeport, N. Y. FR9 5/22/62 | ¥5X BAR| 138 6.5 4.68 X 107 
Central Islip, N. Y. | C4 5/21/62 SXS 11 5.67 | 4.08 X 107 
Trenton, N. J. LY9 3/29/62 SXS 9 4.53 | 3.27 X 10? 
Camden, N. J. Wwo4 4/2/62 | #1XBAR 9 4.63 | 3.33 X 107 
Manahawkin, N. J. LY7 3/20/62 SXS 10 5.02 | 3.61 X 107 
Atlantic City, N. J. | 823 4/6/62 SX8 11 5.64 | 4.06 X 107 
Bridgeton, N. J. Ghil 4/4/62 | #5XBAR 7 3.68 | 2.65 X 107 
Hartford, Conn. 247 6/25/62 SXS 10 4.97 } 3.58 X 107 
Washington, D. C. 232 7/20/62 | #1XBAR| 14 6.75 | 4.86 X 107 
Washington, D. C. 333 7/18/62 | ¥5XBAR| 14 7.0 5.04 X 107 
Washington, D. C. 392 7/19/62 SxS 12 6.08 | 4.38 X 107 
Washington, D. C. 393 7/17/62 | ¥1XBAR| 12 6.0 4.32 * 107 
Newton, Mass. 244 6/27/62 | ¥1XBAR| 11 5.65 | 4.07 X 107 
Waltham, Mass. 899 6/26/62 | ¥5XBAR 8 3.95 | 2.84 X 107 
Quincy, Mass. 773 6/28/62 | #1XBAR 9 4.63 | 3.33 X 107 
South Boston, Mass. | 268 6/29/62 |} ¥5XBAR] 14 7 5.04 X 107 
Atlanta, Ga. 231 8/29/62 

to 8/30/62 | KS5XBAR| 32 | 16 11.52 X 10’ 
Atlanta, Ga. 237 8/28/62 SXS 11 5.23 | 3.76 X 107 
Atlanta, Ga. 457 8/31/62 SXS 13 6.42 | 4.62 X 107 
Atlanta, Ga. 521 8/27/62 | X5XBAR| 138 6.5 4.68 X 107 
Atlanta, Ga. 525 8/28/62 SXS 13 6.5 4.68 X 107 
Hammond, Ind. 844 8/24/62 | X5XBAR 9 4.5 3.24 & 107 
Libertyville, Ill. 686 8/21/62 | ¥5XBAR 8 4 2.88 X 107 
Lafayette, Ill. 247 8/23/62 | ¥1XBAR| 12 6 4.32 X 107 
Wabash, II. 431 8/22/62 | KSXBAR| 12 | 6 4.32 X 107 
Superior, III. 467 8/20/62 | #1 XBAR 7 3.25 | 2.35 X 107 
Los Angeles, Calif. 234 8/7/62 

to 8/8/62 SX8 22 | 10.83 | 7.44 X 107 
Los Angeles, Calif. 273 8/6/62 X¥5XBAR| 12 6 4.32 X 107 
Los Angeles, Calif. 385 8/8/62 SX8 10 4.91 | 3.54 X 107 
Los Angeles, Calif. 620 8/9/62 | ¥5XBAR | 22 | 11.57 | 8.33 X 107 
Los Angeles, Calif. 655 8/10/62 | K5X BAR 6 3.06 | 2.21 X 107 
San Francisco, Calif. | 399 8/16/62 SXS 13 6.58 | 4.74 * 107 
San Francisco, Calif. | 981 8/17/62 | X5XBAR 2 1 0.72 X 107 
San Francisco, Calif.| YU1 8/14/62 | X5XBAR] 11 5.38 | 3.87 X 107 
San Francisco, Calif. | 982 8/16/62 | #IXNBAR]} 12 5.67 | 4.08 X 107 
San Francisco, Calif. | 982 8/15/62 | #1XBAR 4 2.06 | 1.48 X 107 
San Francisco, Calif. | 982 8/18/62 | XIXBAR| 11 5.65 | 4.07 X 107 
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summarized the results of each call. Included were descriptions of any 
unusual transmission or operational conditions which caused the test 
to be interrupted. Appendix A contains a description of the test system. 
The complete test procedure is given in Appendix B. 

During the test 548 calls were completed, each containing approx1- 
mately 3.6 X 10° bits. The magnetic tape data were reduced and ana- 
lyzed for the 412 completed calls which contained errors. The other 136 
completed calls were error free. Fifty-nine attempted calls were not 
completed for reasons which are summarized in Appendix C. 


III. ERROR RATES 


During the course of the test approximately 1.97 10° bits were 
transmitted. Of these, 63,002 bits were received incorrectly, giving an 
over-all bit error rate of 3.19 X 10-5. As has been mentioned earlier, 
data were transmitted in 31-bit code words. Of the 6.36 x 107 code 
words transmitted, 29,731 were found to contain one or more bit errors. 
This gives an over-all word error rate of 4.67 X 10-*. The decoder was 
successful in detecting all but two of the erroneous code words, thus 
yielding an average undetected word error rate of 3.14 X 107%. This is 
equivalent to an average of 9.85 X 108 bits or 136 hours of transmission 
between undetected word errors. 

The over-all distribution of bit error rates per call is plotted in Fig. 1. 
Also plotted in this figure are the corresponding distributions observed 
by Alexander, Gryb, and Nast? for transmission rates of 600 bits per 
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Fig. 1 — Bit error rate distribution for all calls: Alexander, Gryb, and Nast, 
600 and 1200 bits/sec. 
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second and 1200 bits per second in a different test. The distributions 
for the Alexander, Gryb, and Nast 1200 bits per second data and the 
201A data (2000 bits per second) show a remarkable similarity in view 
of the fact that the two tests employed different types of modulators 
operating at different speeds. 

__ A question of major interest is ‘What factors have the greatest effect 
on error rate?” An attempt to answer this question was made by sorting 
the call error rates by all of the known parameters of the call, such as 
types of central offices at the transmitting and receiving ends, time of 
day, day of week, etc. Since none of the calls was actually traced, fac- 
tors such as types of carrier systems in the circuit, types of intermediate 
central offices, etc., for any given call were generally not known. The 
only call parameter examined which showed a clear relationship with 
error rate was distance over which the call was made. Although none 
of the other parameters showed a definitive effect on error rate, this 
does not necessarily imply that these other parameters do not affect 
performance. It is likely that the data recorded did not allow adequate 
separation of the effects of these parameters. 

Calls were classified into exchange, short-haul, and long-haul cate- 
gories. Following the definitions used by Alexander, Gryb, and Nast, 
exchange calls are those within a single dialing area; short-haul calls 
are interarea calls between points separated by an airline distance of 400 
miles or less; and long-haul calls are those exceeding 400 airline miles. 
Distributions of bit and word error rates per call for these categories 
are shown in Iigs. 2 and 3. Again the bit error rate distributions of these 
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Fig. 2— Bit error rate distribution for all calls: exchange, long-haul, and 
short-haul. 
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Fig. 3 — Word error rate distribution for all calls: exchange, long-haul, and 
short-haul. 


three categories are very similar to the corresponding distributions of 
Alexander, Gryb, and Nast. As a rule, word error rates varied quite 
uniformly with bit error rate, indicating that the parameters studied 
had little effect on the density of error bits in an error word. 


IV. CORRELATION BETWEEN ERRORS 


It is well known that digital data errors in telephone circuits tend to 
be bunched together,” but little is known about the exact nature of their 
correlation. One measure of the degree of correlation ketween errors 
is the autocorrelation function of the bit error sequences of the calls. 
Here we shall define the sequence {X;,;}, 7 = 1,2,--- ,N; of call 7 to 
be the binary sequence having 1’s in positions corresponding to the 
positions of bits incorrectly received, and 0’s in positions corresponding 
to error-free bits. The number N; of terms in the sequence is equal to 
the number of bits transmitted in the call. We shall define the nor- 
malized autocorrelation function g(k) of the bit error sequences of any 
collection M7 of calls to be: 

Nj—k 
Dy dy XHX ise 
g(k) == 


1 
j—k 
S > X ji 


7eM i=l 





As the number of terms in the above expression becomes large, o(k) 
will converge to the conditional probability that, given an error bit, the 
bit k positions later will also be in error. 
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The normalized bit error autocorrelation function for all calls con- 
taining errors is plotted in Fig. 4. This curve shows the presence of a 
very strong periodic component. The fact that the oscillations occur at 
a rate of exactly 120 cycles per second strongly suggests 60-cycle power 
line interference with the circuit. This periodic component was traced 
to three calls from a single location. The bit error autocorrelation func- 
tion for all calls except the three containing a 120-cycle component is 
shown in Fig. 5. The general shape of this curve is similar to that ob- 
served in other studies.! The three periodic calls are excluded from the 
remaining distributions. 

The autocorrelation function was also tabulated individually for each 
call. Efforts to find relationships between the autocorrelation and the 
known parameters of the calls were generally unsuccessful. It was 
noticed, however, that the initial shapes of the autocorrelation functions 
were similar for most calls, but the sizes of the tails varied widely. 
For most individual calls the autocorrelation function decreased con- 
siderably more rapidly than did the autocorrelation for all calls, which 
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Fig. 4 — Bit error autocorrelation for all calls. 
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Fig. 5 — Bit error autocorrelation for all calls except those with a 120-cycle 
component. 


is shown in Fig. 5. As one would expect, those calls whose autocor- 
relations had large tails were found to contain short periods of very 
high error rates. Surprisingly, there was not a very strong correlation 
between the call error rate and the size of the tail, but there was an 
apparent relationship between the variance of the error rate over one- 
minute intervals within the call and the size of the tail. This suggests 
that a long tail on the autocorrelation function probably was due to 
short dropouts or very noisy periods which were more or less inde- 
pendent of the over-all error rate. 

The autocorrelation of the word error sequences was similarly com- 
puted. Here the autocorrelation is defined analogously, with error bits 
being replaced by error words. The word autocorrelation for all error 
calls except the three previously mentioned 120-cycle calls is plotted in 
Fig. 6. As one would expect, this curve is very much flatter than the 
corresponding error bit autocorrelation. 

Further insight into the nature of the bunching of the errors can be 
obtained from the distribution of error-free bits between errors. The 
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Fig. 6 — Word error autocorrelation for all calls except those with a 120-cycle 
component. 


empirical cumulative probability distribution function of the number of 
error-free bits between bit errors is shown in Fig. 7. In this curve the 
ordinate gives the fraction of the total number of bit errors whose 
proximity to the previous bit error was equal to or less than the value 
given on the abscissa. As the number of occurrences becomes large, the 
empirical probability distribution function will converge to the true 
probability distribution function. It is interesting to note that the curve 
has a rather sharp knee at about ten bits on the abscissa and levels off 
to an ordinate value of approximately 0.65. This suggests that roughly 
one-third of the bit errors are separated by at least 200 bits (100 ms) 
from the previous error and that the remaining two-thirds of the errors 
are usually separated by not more than ten good bits. The errors are 
therefore observed to be bunched together in groups. The distribution of 
the lengths of these groups will be discussed in the next section. 

The corresponding empirical probability distribution function for 
error-free 31-bit words between word errors appears in Fig. 8. The fact 
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that the derivative of this curve changes comparatively slowly implies 
that small groups of errors are themselves bunched together, since the 
separation of errors would be much greater if they were randomly 
distributed. 


V. ERROR BURSTS AND DROPOUTS 


KKnowledge of the duration and error density of a burst of errors is 
important, since it is desirable to avoid combining bits into words in 
such a way that a substantial fraction of the bits in a single word is 
likely to be in error. Let us define an error burst of density 1/b to be 
any sequence of bits starting with an error bit and at least b bits long 
such that every block of b bits within the sequence will contain at least 
one error bit. In other words an error burst is a sequence which begins 
with an error bit and does not contain b or more consecutive correct 
bits. We shall define the length of the burst to be the length of the 
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Fig. 7 —- Empirical probability distribution function for error-free bits between 
errors. 
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Fig. 8 — Empirical probability distribution function for error-free words be- 
tween errors. 


longest. sequence consistent with the above definition. For example, 
consider the following sequence: 


000000000010110000010000000000. 


Let us assume that the 0’s represent bits which were correctly received 
and the 1’s represent bits which were incorrectly received. According 
to the above definition this sequence contains two bursts of density 
1/5. The first burst begins with the eleventh digit in the sequence and is 
eight bits long. The second burst begins with the twentieth digit of the 
sequence and is five bits long. The sequence could also be thought of as 
containing a single burst of density 1/10 beginning with the eleventh 
digit and 19 bits long. 

The empirical probability distribution functions of the lengths of 
bursts of densities 1/5, 1/10, and 1/31 were calculated and are plotted 
in Figs. 9-11. It can be seen that most high-density bursts are fairly 
short. We observe that the bursts become considerably longer for error 
densities of less than 1/10, which is in agreement with Fig. 7. 
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Fig. 9 — Distribution of lengths of bursts of density 1/5. 
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Fig. 10 — Distribution of lengths of bursts of density 1/10. 
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Fig. 11 — Distribution of lengths of bursts of density 1/31. 


A dropout is a phenomenon whereby the connection is temporarily 
interrupted and the line signal drastically attenuated or completely 
lost for a fraction of a second. In the system tested a dropout caused 
only 1’s to be received regardless of the transmitted message. Any 
sequence containing at least ten bit errors and in which only 1’s were 
received was deemed to be a dropout. On the basis of this definition, 
about two per cent of the high-density bursts were found to be dropouts. 
These were contained in 44 different calls. 

The empirical probability distribution function of the lengths of the 
observed dropouts is shown in Fig. 12. It should be pointed out that this 
distribution may be biased, since some of the longer dropouts probably 
caused the system to lose synchronization and were not included in the 
distribution. The large jump in the curve in the neighborhood of 145 
bits was contributed entirely by four transcontinental calls. One plausi- 
ble explanation for the occurrence of dropouts of this length is that they 
were caused by echo suppressors. Since the echo suppressors were not 
disabled during transmission, a high-energy noise impulse in the reverse 
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channel could momentarily activate an echo suppressor. This would 
cause a dropout of approximately 145 bits duration. 

The empirical probability distribution function of the number of bits 
between dropouts is shown in Fig. 13. It can be seen that the dropouts 
exhibit some tendency to be bunched together in time. This apparent 
bunching suggests fading rather than other possible causes. 

The error data exhibited some asymmetry. There were about 15 
per cent more 0 — 1 errors than 1 — 0 errors, a result consistent with 
the effect of dropouts. The fact that 1 — 0 errors were slightly more 
prevalent than 0 — 1 errors in calls not containing dropouts supports 
the conclusion that dropouts caused the asymmetry. 

A more convenient distribution for some purposes is the distribution 
of the number of error bits appearing within a block of a given length. 
Following Elliott’s® notation we shall define the function P(m,n) to be 
the probability that exactly m bits will be in error in a block of n bits. 
The functions P(m,n) for n = 10, 15, 21, 28, 31, 68, 115, and 230 are 
plotted in Fig. 14. These curves demonstrate quite vividly the effect of 
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Fig. 12 — Distribution of dropout lengths. 
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Fig. 18 — Distribution of distances between dropouts. 


dropouts. Most of the curves exhibit a local maximum at about n/2. 
This is due mainly to the occurrences of dropouts longer than n. On 
the assumption that 0’s and 1’s were transmitted with approximately 
equal probabilities, dropouts of at least n bits in length would contribute 
a component in the form of a symmetrical binomial distribution to the 
P(m,n) function. This is illustrated in Fig. 15, in which the function 
P(m,81) is plotted with and without dropout components. 

Elliott® has suggested that a good approximate evaluation of the 
performance of a code can be made by assuming that all permutations 
of any given number of error bits in a block are equally likely. Using 
his methods and the function P(m,31) the estimated number of bits 
between undetected errors was calculated to be 8.55 X 10%. As stated 
previously, an average of 9.85 X 10° bits between undetected errors was 
actually observed. This is excellent agreement, although it should be 
remembered that the number of observed undetected errors was too 
small to assure good convergence of the observed average to a true 
average. 

The function P(m,n) changes radically as n becomes very much 
larger. Calls were divided into 1-minute and 5-minute time intervals. 
The cumulative empirical probability distribution functions of the 
numbers of bit errors and word errors occurring within these time inter- 
vals were calculated and are plotted in Figs. 16 and 17. It is interesting 
to note that the distributions for 5-minute intervals are almost identical 
to the corresponding distributions for 1-minute intervals except for a 
scale factor. This suggests that the numbers of errors occurring in suc- 
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Fig. 14 — P(m,n): the probability that exactly m bits will be in error in a block 
of n bits. 
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Fig. 15 — P(m,31), with and without dropout components. 
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Fig. 16 — Distribution of error bits per one- and five-minute time interval. 
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Fig. 17 — Distribution of error words per one- and five-minute time interval. 


cessive intervals as long as a minute are essentially independent. Also, 
there is no noticeable effect of dropouts. 


VI. SUMMARY 


The test demonstrated that it is possible to provide data transmission 
over the switched telephone network with extremely low undetected 
error rates by means of a coding technique of moderate complexity. 

The statistical properties of the error data appear to be similar to those 
observed in other tests. Distributions of bit error rates without regard 
to coding showed a strong similarity to the results of Alexander, Gryb, 
and Nast, despite the fact that different modems operating at different 
speeds were used in the two tests. The digital errors were strongly cor- 
related with each other, and the error rates were highly nonstationary. 
Bit errors were observed to occur in groups of two or three and generally 
had a density of at least one error bit per ten good bits. These small 
groups of errors were themselves bunched together. Dropouts occurred 
frequently in certain calls, but it was difficult to determine their cause. 
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APPENDIX A 


The Error Detecting System and Performance Measuring Apparatus 


A preliminary requirement on the code used for the error contro] 
experiment was that it be capable of detecting approximately 99.9 
per cent of all transmission errors occurring in data transmitted by means 
of a DATA-PHONE data set 201A over switched, long distance tele- 
phone connections. Ease and economy of implementation were other 
factors affecting the selection of a code. Computer studies of a Bose- 
Chaudhur? (31,21) code indicated that this code, subsequently used in 
the experimental system, had the desired error detecting ability. The 
above notation indicates that data were transmitted in blocks 31 bits 
long consisting of 21 information and 10 check bits. 

The code is cyclic with a minimum distance of five* and is therefore 
capable of detecting any four or fewer bit errors in a 31-bit block. 
Furthermore, all single-error bursts} of length 10 bits or fewer, 511/512 
of all 11-bit error bursts and 1023/1024 of all error bursts 12 to 31 bits 
in length are detected. The generator polynomial of this code is h(X) = 
X10 + X7-4+ X*®-+ X + 1, which is equivalent to saying that the code 
is the null space of the matrix: 


1000000000100110101001000011111 
0100000000110101111101100010000 
001000000001 1010111110110001000 
0001000000001101011111011000100 
0000100000000110101111101100010 
0000010000000011010111110110001 
0000001000100111000010111000111 
0000000100110101001000011111100 
e0000n0;001101010I6000111150 | 
0000000001001101010010000111111 


* T.e., every code word (block) differs in at least five places from every other 
code word, 

+ The length of a “‘burst’”’ in this context is the number of bits between and 
including the first and last bits in error in a 31-bit block. 
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Examination of row 10 of the H matrix (in its canonic form) reveals 
that the first check bit transmitted is the modulo 2 sum of information 
bits di, do, ds, ds, ds, ds, dur, dis, dis , dig, and diy. Information bits 
are numbered in the order of their transmission—i.e., d; is the first in- 
formation bit in the block, d; the second, ete. 

The encoder was implemented by the feedback shift register shown in 
Tig. 18,6 which operates as follows. With the feedback path closed 
(i.ec., S in position 1), 21 information bits were shifted from the data 
source into the encoder and simultaneously transmitted. After the 
twenty-first bit had been encoded and transmitted, the feedback path 
was disabled by setting S to position 2, and the contents of the shift 
register (i.e., the 10 check bits) were transmitted. Thus, each block con- 
sisted of 21 information bits transmitted as a group in their original 
order followed by the 10 associated check bits. 

Decoding was accomplished by the same circuit. The decoder was 
synchronized with respect to the received data and thus was able to 
distinguish between information and check bits. To decode a 31-bit 


) > EXCLUSIVE OR 
GATE 
| | SHIFT REGISTER STAGE 






’ Fig. 18 — (31,21) encoder. 
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block the information bits were shifted from the demodulator into the 
encoder, which generated 10 check bits. These were compared to the 10 
check bits received following the 21 information bits. Any difference 
between the two sets of check bits indicated the occurrence of a trans- 
mission error. 

A block diagram of the error detection system is shown in Fig. 19. 
Both the source of data and the reference source are provided by a 
pseudo-random sequence generator. The output of this device is a repeti- 
tive 511-bit sequence containing every 9-digit binary sequence except 
the all-O sequence. Since these data are meaningless so far as information 
content is concerned, a continuous chain of timing pulses shifts both the 
encoder and sequence generator. The output of the source is disregarded 
while check bits are shifted from the encoder. Since 31 and 511 are rela- 
tively prime, all possible 21-bit sections of the 511-bit sequence are 
transmitted as data. In practice the data source must stop after deliver- 
ing 21 bits, while the 10 associated check bits are transmitted. For pur- 
poses of the test this would be impractical, since 511 and 21 have a 
common factor of 7, and therefore only 73 of the possible 511 21-bit 








CLOCK AND 
CONTROL 
CIRCUIT 


CLOCK AND 
CONTROL 
CIRCUIT . 


Fig. 19 — The test system. 
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sequences would be encoded. Timing is provided by the data set 201A, 
which generates 2000 timing pulses per second. The function of the 
clock and control circuit is to operate the encoding circuits at the trans- 
mitting and receiving terminals in the manner described earlier. The 
clock is driven by timing pulses from the data set and produces a periodic 
output signal which is “on” for 21 bits and “off” for 10. This signal is 
used by the control circuit to operate S (see Fig. 18), S1, and 82.* 

At the transmitting terminal S and S81 were set to position 1 while 
information bits were shifted into the encoder and data set, then switched 
to position 2 while check bits were shifted from the encoder. Switches S 
and Sl then were reset to position 1 and the process was repeated for 
the next block of information. 

At the receiver S and S2 were set to position 1 while information bits 
were received and encoded. During this time the received information 
was examined for transmission errors by E1, which produced an output 
whenever a received information bit differed from the output of the 
synchronized reference sequence generator. (The method of synchroni- 
zation will be described in the next paragraph.) After the 21 information 
bits had been received and encoded, S and S82 were switched to position 
2 while the check bits were received. Each of the 10 check bits in a block 
was compared with the output of the local encoder by E2. An output 
signal from E2 indicating the occurrence of detected errors was produced 
whenever a received check bit differed from the corresponding locally 
generated check bit. The outputs of El and E2, the received data, and 
timing information from the clock and control circuit were delivered to 
the recording equipment. 

The clock and sequence generator at the receiving terminal were 
synchronized with respect to the demodulated data by means of the 
synch circuit. This circuit was activated manually by switching 83 
to “synch.” With 83 set to “synch” the phase of the sequence generator 
and clock was automatically shifted by one bit with respect to the re- 
ceived data in response to each output pulse from E1. When the outputs 
from El and E2 were observed to remain constant for one second or 
more, 83 was switched manually to ‘‘run” and the recording equipment 
started. Error data could not be recorded with 83 in the “‘synch”’ posi- 
tion. 

The performance measuring apparatus was located at the receiving 
terminal as indicated in Fig. 19. Signals from E1, E2 and the control 
circuit were combined to indicate the occurrence and type of block errors. 


* Switches 8, 81, and S2 are implemented with solid-state circuits. 
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If, for a given 31-bit block, an error indication was received from E2 
then transmission errors occurred and were detected. Undetected errors 
occurred in a block when errors were indicated in the information sec- 
tion by E1 but not in the check section by E2. 

The received data and the error information derived from the outputs 
of E1 and E2 were recorded on dual-channel magnetic tape. The output 
of the demodulator was transformed into a series of positive and nega- 
tive pulses and recorded on one channel. Following each 31-bit block one, 
two or three framing pulses were recorded on the second channel to 
indicate that the preceding block contained no errors, detected errors or 
undetected errors respectively. The inputs to both channels of the tape 
for each of the three conditions are shown in Fig. 20. Data were recorded 
on channel 1 and block framing on channel 2. In each case only one 
framing pulse follows block K — 1, which is assumed to be error free. 

Cumulative error data for each call were recorded on five electronic 
event counters. The two types of bit errors (0 — 1 and 1 — 0) occurring 
in user information were derived from the output of E1 and recorded 
separately on two counters. A third counter was incremented whenever 
a block was received containing any errors in the information section. 
The fourth and fifth counters recorded detected and undetected block 
errors respectively. The counters were photographed automatically at 
20-second intervals during each call. A clock was included at the camera’s 
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Fig. 20 — Block framing; three cases. 
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field of view and the film dated so that photographic data could be 
correlated with log book records. 


APPENDIX B 


Field Test Procedure 


The purpose of this section is to describe the field test procedure in 
some detail. To review briefly, the receiving error control terminal re- 
mained fixed at Murray Hill or Holmdel and the transmitting terminal 
was carried to the locations listed in Table III. One day was spent at 
each location, during which time data were transmitted over a number 
of dialed connections established from the receiving terminal and used 
for approximately 30 minutes each. 

Before initiating a series of calls between two locations, the line loss 
from the transmitter to the central office was measured with a 12B 
transmission measuring set. The data set’s transmission level then was 
adjusted so that the signal strength at the central office was approxi- 
mately —8 dbm and the data set then placed in the on-hook automatic 
answer mode. The location, telephone number, local loop loss, trans- 
mitting level, and type of central office serving the transmitting terminal 
were recorded in the test log. The encoding equipment was started and 
ran continuously throughout the series of calls (i.e., data were generated 
and encoded continuously and transmitted automatically whenever a 
connection was established from the receiving terminal). 

After the transmitting terminal had been readied for the series of 
calls as described in the preceding paragraph, the receiving terminal 
was attached to a foreign exchange line. A connection was dialed to the 
transmitting terminal, which answered the call automatically and started 
transmitting encoded data immediately. Then the receiving error de- 
tection system was synchronized with respect to the demodulated data 
as described in Appendix A and the recording equipment started. Thirty 
minutes later the recording equipment was stopped and the call termi- 
nated from the receiving end. The data recorded on event counters, 
the times (local) at which the call was started and terminated, and a 
description of any unusual transmission or operating conditions were 
entered into the test log. The recording equipment was reset and the 
receiving terminal attached to a different foreign exchange line for the 
next call. 

The error control equipment, data sets, and error recording apparatus 
were checked periodically throughout the test. This was done to insure 
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that the data collected during the experiment would not be affected by 
marginally operating test equipment. 


APPENDIX C 


Summary of Incomplete Calls 


During the test, 59 calls could not be completed for reasons which 
fall into three general categories. These are: 

(1) long dropouts or fades resulting in loss of synchronism between 
the transmitting and receiving data sets, 

(2) the inability to achieve initial synchronization of the terminals 
within a reasonable length of time, and 

(8) lost connections. 

These conditions will be more fully described in the following para- 
graphs. In addition to these 59 calls, 6 calls were interrupted due to 
human errors and two calls were lost as a result of local power failures. 

Loss of synchronism between the transmitting and receiving data sets 
during otherwise normal communication caused the interruption of 30 
calls. This condition usually was caused by long dropouts, particularly 
on long-haul connections. Dropouts lasting more than approximately 
100 milliseconds generally caused the transmitting and receiving data 
sets to lose synchronism, since timing in the demodulator was derived 
from the line signal. Within 20 milliseconds of a loss of line signal the 
timing reverted to the natural resonant frequency of the high-Q circuit 
in the demodulator’s bit synch recovery circuit. The natural frequency 
of this high-Q circuit was within one cycle of the transmitter frequency. 
Thus, if the modulator and demodulator remained decoupled long 
enough, synchronism between the two was lost. This situation was 
detected easily but resulted in some loss of data, as the terminals had to 
be resynchronized before data transmission could be resumed. Intense 
channel noise was observed to have approximately the same effect as 
dropouts. 

Twenty-one dialed connections were sufficiently noisy that the test 
apparatus could not be synchronized within a reasonable length of time. 
To recapitulate, initial synchronization was obtained by automatically 
shifting the phase of the receiving end clock and sequence generator by 
one bit whenever the synchronization circuit was enabled and a re- 
ceived bit differed from the corresponding locally generated information 
bit. When the system was synchronized, but the synchronization circuit 
was not yet disabled, any transmission errors occurring in the informa- 
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tion section of a block caused the synchronization procedure to be re- 
peated. Therefore, when the channel was unusually noisy the receiving 
terminal would not remain synchronized long enough for the synchro- 
nization circuit to be disabled manually. If synchronization could not be 
established within 2 or 3 minutes after the call was placed the connec- 
tion was dropped and the transmitting terminal called a second time 
using the same foreign exchange line. The semiautomatic synchroniza- 
tion procedure could have been fully automated. Had this been done, 
synchronization might have been achieved over a few of the connections 
for which the semiautomatic method described in Appendix A was un- 
successful. However, since all 21 of these calls were exceptionally noisy 
it appears doubtful that data set synchronism could be maintained for 
a full 30-minute period. 

Eight connections were lost entirely during data transmission and dial 
tone was returned to both terminals. This situation was easily detected 
by the test apparatus. On at least three of these occasions the lost con- 
nections appeared to be associated with telephone maintenance opera- 
tions. 

The conditions described above are transmission impairments which 
cannot be integrated directly into the error rate data. These are, how- 
ever, situations with which the data communicator must contend and are 
included here to provide an estimate of their frequency of occurrence. 
These data are of importance in the design of error control systems which 
must recognize such transmission impairments and allow for some type of 
remedial action to be taken, such as manual intervention or automatic 
resynchronization. 
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A Simulation Study of Routing and 
Control in Communications 
Networks* 


By J. H. WEBER 
(Manuscript received March 23, 1964) 


A set of studies has been undertaken to develop guidelines for the design 
and operation of communications networks with automatic alternate routing. 
Comparisons are made of engineered costs and overload capability of net- 
works using several alternate rouling configurations, and employing a num- 
ber of different operating and control procedures. The traffic model selected 
consists of a 34-node network abstracted from the U.S. telephone toll network, 
with basic load levels obtained from field data. The overload evaluations were 
made using a simulation program prepared for the [BAI 7090 computer. 


I. INTRODUCTION 


In a recent paper! the results of some preliminary comparisons of two 
alternate routing configurations for communications networks were re- 
ported. Those results indicated that for small networks (six or fewer 
nodes) with low traffic densities a symmetrical or unrestricted routing 
pattern is superior to a hierarchy similar to that in use in the U.S. toll 
network, while for higher traffic densities there appeared to be little 
difference in the network behavior in terms of economy and reaction to 
overloads. 

Subsequently, a new simulation program has been constructed? and 
substantially larger networks have been examined to provide a more 
meaningful guide to network design under various circumstances of 
geography and load level. An additional configuration, called the ‘“‘gate- 
way,’ as well as several operating and control variations, has been 
examined. The latter include stage-by-stage operation with and without 
crankback (return of routing control to a previous node for rerouting 
when blocking is encountered at an intermediate switching point), limita- 
tion of number of links per call in symmetrical networks, and trunk 
reservation for first-routed traffic only. 


* Presented at the Fourth International Teletraffic Conference, London, July, 
1964. 
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The results show that: 

(1) There is little difference in network cost or overload capability 
between hierarchical and symmetrical networks at the load densities 
considered. 

(2) The gateway network (a two-level hierarchy with no interregional 
high-usage groups) requires substantially more trunking and switching 
than either the hierarchical or symmetrical networks and shows no sig- 
nificant difference in overload performance. 

(3) Restriction of alternate routing in symmetrical networks improves 
performance at all levels of load. 

(4) The use of crankback is a disadvantage for symmetrical networks 
with a high traffic density, at all levels of overload. It offers a slight ad- 
vantage for symmetrical networks with lower traffic intensities and does 
not appear to have any significant effect on the performance of hierarchi- 
cal networks. 

(5) Trunk reservation for first-routed traffic on a dynamic basis im- 
proves the performance of almost all networks examined, for all load 
conditions, and displays no detrimental effects. 


Il. THE SIMULATION 


The simulation program used in these studies is described in Ref. 2. It 
has many of the capabilities of the program described in Ref. 1, but 
has been reprogrammed to accept networks with heavier loads and to 
operate more efficiently. A number of additional features have also been 
provided. 

The program is basically capable of simulating networks of up to 63 
nodes, with arbitrary alternate routing patterns and stage-by-stage call 
forwarding. There is no congestion or delay allowed at switching points, 
all congestion being assumed due to trunk shortages. Calls which fail to 
complete initially may be abandoned with a fixed probability or retried 
after a constant or exponentially distributed interval. Any prespecified 
number of trunks can be reserved for first-routed traffic only, and calls 
may ‘‘crank back” or return to a prior node if blocked at some point in 
the network. The maximum-size network which can be accommodated 
is largely determined by the number of simultaneous calls in progress, 
which may have a maximum of about 6000. Traffic loads are specified 
on a point-to-point basis, with arbitrary proportions in each direction, 
and may be changed linearly at any time during the run. That is, mean 
arrival rates can change linearly in time during the run at any rate and 
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between any bounds. (Another modification of the program allows the 
use of two priorities of traffic and mixtures of direct and store-and-for- 
ward traffic, with trunk reservation by traffic type and priority. This 
version was not used for the studies described herein, however.) 

In order to accommodate larger networks more efficiently, the program 
was written in several sections. The first of these accepts the basic load 
inputs (mean point-to-point loads and holding times) and generates call 
arrival times and holding times, which are then stored on magnetic tape. 
This tape is then used as input to the simulation program, which proc- 
esses the calls through the simulated system and prints out raw data on 
trunk utilizations and call histories on two magnetic tapes. These tapes 
are presented to the output processor programs, which provide the ap- 
propriate reduced outputs. 

For convenience in preparing the input data, the main section of the 
program has been arranged to determine its own routing for symmetrical 
and hierarchical networks, given the numbers of trunks and the distances 
for symmetrical networks, or the homing arrangements for hierarchical 
networks. 

The output statistics are reported at prespecified time intervals, and 
these subinterval results may then be used as samples for a final output 
containing both means and standard deviations of all relevant quantities. 
The quantities which are printed out are as follows: 

(1) point-to-point traffic loads at the end of the run (input data). 

(2) routing tables for all point-to-point traffic items. 

(3) means and standard deviations of the following measured quanti- 
ties for each point-to-point traffic item: 

(a) blocking probability 
(b) average delay and distribution of delays for retried calls 
(c) average number and distribution of number of links per call. 

(4) means and standard deviations of the following measured quanti- 

ties for each trunk group (obtained by switch count measurements) : 

(a) number of trunks present in each group (input data) 

(b) number of trunks reserved for first-routed traffic in each group 
(input data) 

(c) total carried load in erlangs on each group (and per cent occu- 
pancy) 

(d) first-routed carried load on each group (and per cent of total) 

(e) alternate routed carried load on each group (and per cent of total). 

(5) means and standard deviations of measured over-all network 
quantities as follows: : 
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(a) over-all average blocking probability, B, given by 


_ D> ai; Bi; 
paw 


> Ai; 
tg 
where 


a;; = offered load between nodes 7 and j, and 
B;; = blocking probability of calls offered between nodes 7 and 7 


(b) average number and distribution of number of links per call 
(c) weighted average delay and delay distribution for retried calls 
(d) total number of trunks in the network (input data) 
(e) total trunks reserved for first-routed traffic (input data) 
(f) total carried load and over-all occupancy 
(g) total carried first-routed load 
(h) total carried alternate routed load. 
(6) the “space dispersion,’ D, of the blocking probability, delay 
distribution and links per call distribution, given by 


Dz = (d ayBi?/ 2, ai;) — B} 


for the blocking probability, and similar expressions for the other quanti- 
ties. This serves as a measure of the variation in grade of service pro- 
vided to various traffic items in the network depending upon their origin 
and destination. 
(7) the following input parameters: 

(a) maximum number of links aJlowed per call 

(b) number of stages of crankback allowed (That is, the number of 
steps a call is allowed to back up before progressing forward after having 
reached a point of congestion.) 

(c) percentage of calls to be retried 

(d) retrial time distribution and mean value 

(e) holding time distribution and mean value 

(f) number of nodes 

(g) number of reporting intervals and their lengths 

(h) number of reporting intervals to be collected for final processing 

(i) routing pattern (hierarchical or symmetrical) 

(j) interval between switch counts for determination of trunk in- 
formation. 

The simulation runs quite rapidly, processing about 500,000 calls per 

hour using the IBM 7090 computer for a 34-node network with a total 
load of about 5000 erlangs, excluding the traffic generation. If traffic 
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generation time is included, the processing rate drops to about 375,000 
calls per hour. If several networks are evaluated using the same traffic 
input, as was done in these studies, however, the traffic need be generated 
only once, and the same tape can be used as input to the simulation any 
number of times. Several dozen simulation experiments were made for 
the studies described below, but only eight traffic tapes were generated. 


II. GENERAL PROCEDURE 


The evaluation procedure encompasses the following steps: 

(1) select a geographical area, including switching center locations; 

(2) select a basic trafic model on which to base network engineering; 

(83) engineer networks to a given grade of service using each of the 
routing procedures to be considered; 

(4) determine the costs of each of the networks so engineered; 

(5) change the loads to correspond to reasonable patterns of overload 
or shifting load; 

(6) using simulation, measure the performance of each of the networks 
under the load changes used in step (5); 

(7) repeat steps (5) and (6) for each of the control and operating 
variants considered. 


These steps will be described in detail in the following sections. 


3.1 The Geographical Region 


Although it is not possible to select a geographical region (or regions) 
which will be typical of all situations, it is desirable to find an area which 
at least has the capability of accommodating a sufficient number of nodes 
to adequately exercise the various routing patterns to be examined and 
of reacting to realistic load fluctuations. The region should also contain 
both densely and sparsely populated sectors, which to some extent must 
exist in all real networks. (A uniform or arbitrarily variable traffic dis- 
tribution would probably not be a valid test, since actual telephone 
traffic varies with the population density, higher-density areas having 
large amounts of traffic within and among them, and sparsely populated 
areas being lower in traffic to all destinations.) Since the geographical 
region will utimately require a traffic model to be superimposed upon it, 
an area for which actual traffic data is obtainable is again more likely to 
represent reality than one for which traffic quantities need to be invented. 

A single region which appears to meet most of the criteria specified 
above exists on the West Coast of the U.S.A. The states of California, 
Washington, Oregon and Nevada are almost entirely administered by 
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two local telephone operating companies for toll purposes and represent 
a region which ranges from sparsely populated areas such as Nevada and 
eastern Oregon to sections such as Southern California, which contains 
the Los Angeles and San Diego metropolitan areas. lig. 1 is a map of 
this region, showing the 34 switching centers used. Although there are 
many more than 34 toll switching offices in this region, only the control 
switching points (CSP’s), which make up the offices of the top three 
levels of the U.S. toll network hierarchy,’ are included. (Las Vegas, 
though actually a toll center, is assumed to be a primary center.) All 
traffic which both originates and terminates in the region, however, is 
included, as will be discussed below. 
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Fig. 1 — U.S. West Coast traffic region. 
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3.2 The Basic Traffic Model 


The basic traffic model used for engineering the various networks was 
developed from actual message records of the Pacific Telephone and 
Telegraph Company and the Pacific Northwest Bell Telephone Com- 
pany. These records include the total messages for a period of ten con- 
secutive business days during June 1962. They provide total messages 
and message minutes from every toll switching center in the area to 
every other. Traffic which originates or terminates in other than the 
four-state area is not considered, nor is traffic which is not carried on the 
toll (or long distance) network. Traffic originating and/or terminating 
at offices of connecting companies, but which is carried on the toll net- 
work, is included. 

In order to obtain a busy hour traffic base from the ten-day records, 
it was assumed that ten per cent of the total traffic was offered during 
each day and ten per cent of the day’s traffic was presented during the 
busy hour. Therefore the busy hour traffic load was assumed to be one 
per cent of the total ten-day message load. 

Traffic between toll centers of the fourth rank, which are not explicitly 
included in the 34-node model, is handled in two ways, giving rise to two 
networks with different traffic densities. In the first of these, called the 
‘full-traffic” or “full-load” network, all traffic between toll centers is 
added to that between the centers on which they home. Traffic between 
toll centers homing on the same control switching point is eliminated. 
For example, referring to Fig. 2, traffic between toll centers A and B is 
added to the traffic between control switching points D and FE, as is the 
traffic between A and E, and between B and D. Traffic between toll 
centers B and C and between points A and D, and B and E is eliminated. 

In the second network, called the ‘“‘reduced-traffic” or “‘reduced-load”’ 
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Fig. 2— Disposition of toll center traffic. 
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network, it is assumed that some of the traffic between toll centers is 
carried on high-usage groups, and only the overflow is carried on the 
groups between the CSP’s. Therefore traffic between toll centers such 
as A and B, in which there may be two routes before the CSP-CSP trunk 
group is reached, is assumed to overflow only 10 per cent to the D-E 
group. Traffic between a toll center and a CSP, such as that between A 
and I, may have only one route before the CSP-CSP route is reached, 
and 20 per cent of this traffic is then assumed to be offered to route D-E. 
Traffic between toll centers such as B-C, or between toll centers and the 
CSP on which they home, such as A-D, is again eliminated. 

The net effect of these assumptions is to develop a total network load 
of 4764 erlangs for the full-load case, and 2031 erlangs for the reduced- 
load network. The maximum point-to-point load for the full traffic net- 
work is 158 erlangs, and the maximum load originating and terminating 
at any node is 848 erlangs. The minimum point-to-point load is 0.01 
erlang, and the smallest node has 26 erlangs originating and terminating 
at it. For the reduced network the maximum point-to-point load is 84 
erlangs and the minimum is zero. The total traffic originating and termi- 
- nating at the largest node is 288 erlangs, and at the smallest is 15 erlangs. 
A tabulation of the total loads originating and terminating at each point 
in both networks is given in Table I. 


3.3 The Network Configurations 


Five specific networks of three configuration classes for the full-traffic 
model and two networks in two classes for the reduced-traffic model 
were examined. The first class of networks is hierarchical in structure, 
similar to that in use in the Bell System toll network. In these networks, 
trunk groups are defined as high-usage, which may overflow traffic to 
alternate routes, or final, which may not. The apportionment of trunks 
among high-usage and final routes is decided on an economic basis.* 
Both two- and three-level hierarchies were examined in the full traffic 
model, while only two levels were used for the reduced traffic case. The 
routing for these networks is shown in Fig. 3. 

In Fig. 3(a) the basic elements of a two-level hierarchy are shown. 
Calls from node 1 to node 2 will, if unable to use the direct route, attempt 
to reach node 4, from which the only allowable choice is the final route 
4-2. If unable to reach node 4, calls will then attempt to reach node 8, 
from which they will attempt the direct route 3-2, finally overflowing to 
the final route 3-4. Calls from 2 to 1 will reverse the procedure, attempt- 
ing to reach node 3 and overflowing to node 4. Calls initially routed 
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TaBLE I—SwircHIne CENTER LOADS 





Total Originating and Terminating Traffic 











In Erlangs 
Switching Center 
Full Load Reduced Load 
Bellingham, Wash. 94.17 34.81 
Seattle, Wash. 533.25 272.59 
Spokane, Wash. 141.89 71.96 
Yakima, Wash. 175.17 59.73 
Astoria, Oregon 26.92 16.87 
Bend, Oregon 26.65 15.19 
Klamath Falls, Oregon 33.12 19.60 
Medford, Oregon 62.42 37.08 
Pendleton, Oregon 51.35 21.11 
Portland, Oregon 468 .05 233.56 
Roseburg, Oregon 37 .37 17.85 
Las Vegas, Nevada 116.74 82.04 
Reno, Nevada 138.21 69.44 
Fresno, Calif. 306.11 140.81 
Modesto, Calif. 132.55 81.86 
Stockton, Calif. 206.21 105.38 
Redding, Calif. 87.94 30.88 
Sacramento, Calif. 539 .32 173.63 
San Jose, Calif. 459.24 188.90 
Oakland 4M, Calif. 848 .22 288 .14 
Oakland Fr., Calif. 524.25 194.06 
Palo Alto, Calif. 251.14 108.99 
San Francisco, Calif. 375.40 198.45 
San Rafael, Calif. 134.12 40.26 
Santa Rosa, Calif. 386.08 107.64 
Bakersfield, Calif. 167.84 93 .56 
San Luis Obispo, Calif. 109.15 45.59 
Compton, Calif. 447 .04 190.15 
Los Angeles, Calif. 699.27 242.63 
El Monte, Calif. 361.88 179.54 
Van Nuys, Calif. 426.75 174.32 
Anaheim, Calif. 158.45 73.78 
San Bernardino, Calif. 576.81 202.72 
San Diego, Calif. 424 33 248.71 
Total (orig. plus term.) 9527.41 4061.54 








along the final route chains, such as those from 1 to 3, have only a single 
choice of route. : 

The three-level network, shown in Fig. 3(b), allows a somewhat more 
complicated routing pattern. Calls from 1 to 2 in this network will at- 
tempt to reach nodes 2, 4, 6, 5 and 3 in that order, and all other routes 
will follow a similar pattern of hunting from low to high level in the 
distant region, and from high to low level in the home region. In no event 
can a call use more than one interregional trunk, and calls always travel 
up the hierarchy in the home region and down in the distant region. A 
fuller description of the process is given in Ref. 3. This restrictive routing 
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Fig. 3 — Organization of hierarchical networks. 


pattern allows alternate routing to proceed without fear of ‘‘ring-around- 
the-rosie”’ or “shuttling” (which are types of looping routes), even 
though no information is carried with the call other than its destination 
code. The two-level hierarchical networks actually used contained six 
higher-level offices, located at Seattle, Portland, Sacramento, Oakland, 
San Bernardino, and Los Angeles. The three-level network took Port- 
land, Sacramento, and San Bernardino as highest-level, or regional, 
centers; leaving Seattle, Oakland, and Los Angeles as middle-level, or 
sectional, centers. A sketch of the Washington-Oregon section of the 
full-load, two-level hierarchy is shown in Fig. 4. 

The other network configuration examined for both load levels is the 
symmetrical network, in which alternate routes are selected approxi- 
mately according to their total length. In all such networks studied, 
trunks are arbitrarily eliminated on links with less than 2 erlangs of 
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directly offered traffic, and routing is then established using equipped 
links. Fig. 5 shows the trunk group layout for the Washington-Oregon 
section of the full-load symmetrical network. A basic restriction in all 
networks is that at most five outgoing choices are allowed from any node 
to any other, it being considered that further choices would lead to 
excessively circuitous routes. In addition, no route is allowed which is 
more than 1.5 times as long as the shortest nondirect route, or exceeds 
the shortest nondirect route by more than 2 links. These numbers were 
arrived at by trial and error and produced the most economical network 
for the full-load case, although they were not very critical in the deter- 
mination of network cost or capability. Two symmetrical networks are 
studied in the full-load case, one which matches the blocking performance 
of the other networks at engineered loads, and one which has a higher 
blocking, as described below. Only one symmetrical network is used for 
the reduced-load model. 

The method by which routes are selected is as follows. Initially, the 
shortest route between each two points is found. The route to the nearest 
neighbor node on this route is then listed as the first-choice route. The 
link from the originating node to the nearest neighbor node along the 
first-choice route is then made ineligible, and the shortest route again 
found. The link to the nearest neighbor node along this route is then 
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Fig. 4 — Full-load, two-level hierarchy — Washington-Oregon. 
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Fig. 5 — Full-load symmetrical network — Washington-Oregon. 


denoted the second-choice route, and the distance and number of links 
calculated and compared with the first nondirect route. The entire pro- 
cedure is repeated until no route falls within the distance ratio and link 
difference criteria, or five routes are selected, whichever occurs first. At 
this point the process is terminated and the routing table established. 
For example, in Fig. 5, to go from Yakima to Medford, the first-choice 
route is via Portland, the second is via Seattle, and the third is via 
Pendleton. 

The third network configuration, considered in the full traffic case 
only, is the gateway network. This is essentially a two-level hierarchy 
with the interregional high-usage groups removed, as shown in Fig. 6 
for Washington and Oregon. Traffic and trunks are therefore concen- 
trated along the access routes to the gateway switching center and on the 
interregional finals. Although this kind of system clearly requires more 
trunks and trunk miles than a hierarchy to carry the same loads, it has 
been conjectured that savings in line and terminal equipment could be 
effected because of the large trunk cross sections involved. It also has been 
thought that this scheme might provide improved performance under 
shifting loads, which hypothesis is examined in this study. The gateway 
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network studied assumed the gateway switches to be located at the same 
points as the higher-level offices in the two-level hierarchical networks. 

In all of the above networks, stage-by-stage routing similar to that in 
the U.S. toll network is used. That is, once a call has reached a certain 
point in its path, its route selection is independent of its past history, 
and it is unable to back up and find another route out of a prior node. 
(This is not true if crankback is allowed, as will be discussed later.) In 
the symmetrical networks, the previous route is considered to the extent 
of preventing a call from returning to a node through which it has al- 
ready been switched. In the hierarchy and gateway, this restriction is 
implicitly provided by the logic of the routing structure. 

In sum, the networks examined are as follows: 


(1) Full-load model 


(a) two-level hierarchy 

(b) three-level hierarchy 

(c) symmetrical 

(d) symmetrical with high blocking 
(e) gateway. 
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Fig. 6 — Gateway network — Washington-Oregon. 
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(2) Reduced-load model 


(a) two-level hierarchy 
(b) symmetrical. 


3.4 Engineering Procedures 


The size and complexity of the networks considered are such that 
manual engineering procedures or trial and error methods are not feasi- 
ble. Accordingly, computer programs were prepared which established at 
least an initial network, which could then be adjusted if required by 
trial and error using the simulator. The objective for all networks was to 
attain an over-all average blocking probability of 0.01, with as small a 
dispersion of individual point-to-point probabilities as possible. This is a 
somewhat different criterion than the one normally used in existing 
hierarchical alternate routing systems, which specify the blocking proba- 
bility observed on the final route, but it is closer in philosophy to local 
systems and others in which blocking probabilities produced by the 
system are the same to all customers. 

The hierarchical networks were engineered with the aid of a computer 
program which essentially follows the procedure outlined in Ref. 4. 
Using this method, traffic is transferred from the direct to the alternate 
route when the direct route becomes so inefficient that the cost of adding 
a trunk to it is more than the cost of carrying the traffic on the alternate 
route. No account was taken of the nonrandomness of overflow traffic® 
or of the nonindependence of different links in the network. The errors 
resulting from these assumptions were not large and were corrected where 
required by trial and error using the simulator. 

The process for engineering a symmetrical network is less well de- 
veloped, and no method for designing an optimal, or even necessarily a 
very good network, exists. However, a program is in existence® which is 
capable of designing networks which will closely meet a desired blocking 
probability, using prespecified routes which are fully determined from 
origin to destination. It was necessary, in order to use this program, to 
convert the shortest route procedure described in Section 3.3 above to 
one which provides the full route rather than simply the order of hunt 
over the adjacent nodes. This resulted in networks which were engineered 
using a slightly different routing arrangement than the simulator ac- 
tually used, and this, in conjunction with the basic assumptions implicit 
in the engineering program of random overflow traffic and independent 
links, led to blocking probabilities in the final network which were some- 
what higher than desired. These were corrected for the purposes of 
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comparing network configurations, but for certain studies of various 
methods of operating a symmetrical network, the networks with high 
blocking were retained. 


3.5 Load Changes 


Three patterns of load changes were used to measure the performance 
of the various networks under shifting load conditions. The two load 
changes examined in Ref. 1, uniform overload and overload of all traffic 
to and from a particular node, were avoided because of their limitations. 
The first case, uniform overload, represents a situation which is thought 
not to ordinarily occur in large real systems, and both load models are 
likely to obscure differences in behavior of competing networks, since 
the networks tend to be completely saturated or are limited by the 
specific overloaded nodes. Instead, three patterns of shifting loads, in 
which the total offered network load remained approximately unchanged, 
were used. 

The first of these, called the “Christmas load,” represents a type of 
shifting load normally seen in the U.S. on Christmas Day and on a few 
other special occasions. On these days, the normal long distance business 
traffic disappears and is replaced by a large volume of residential traffic. 
Typically, the increased traffic is of substantially longer haul than is 
the normal day traffic, so the phenomenon observed is that short-haul 
traffic decreases, but long-haul traffic increases. In order to represent this 
in the sample Pacific network, the network was broken down into four 
areas, consisting of Washington, Oregon, Northern California and 
Southern California. (Northern Nevada was included with Northern 
California and Southern Nevada with Southern California.) All intra- 
area traffic was reduced to 60 per cent of its normal value, and inter- 
area traffic was increased to from 150 to 275 per cent of its normal 
value, depending upon the distance. The total network load was 94 per 
cent of its normal value, as shown in Table IJ. Although these changes 
may appear extreme, they are not thought to be out of line with what 
actually occurs in the U.S. on Christmas and were applied to both full 
and reduced traffic models. 

The second load change examined is not typical of any actual situa- 
tion, but was designed to evaluate the effectiveness of the various net- 
works in shifting load from an overloaded trunk group to a simultane- 
ously underloaded one. In order to do this for the full-load network, 
all traffic items originating or terminating at the Oakland 4M machine, 
the largest office in the network (83 traffic items, total load 848 erlangs 
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TABLE IL—CuristmMas LOAD CHANGES 











Traffic % of Normal Day Busy Hour Load 
Intraregional 60 
Washington-Oregon 150 
Washington-Northern Calif. 210 
Washington-Southern Calif. 275 
Oregon-Northern Calif. 175 
Oregon-Southern Calif. 230 
Northern Calif.-Southern Calif. 150 
Network 94 





representing about 20 per cent of.the network load), were either halved 
or doubled at random, although this was slightly modified so that the 
total network load remained at 99.5 per cent of its normal value. The 
normal and modified values of the Oakland loads are shown in Table III. 
In the reduced traffic model no single office had enough traffic to cause 
substantial changes in total network performance, so the halving and 
doubling were done at Seattle, Oakland and Los Angeles, which have 
total loads of 767 erlangs, representing about 40 per cent of the total 
network load. In this case the total network load increased by about 8 
per cent. It is to be emphasized that this set of loads does not represent 
any expected realistic situation, but is a completely artificial test of the 
effectiveness of automatic rerouting under most favorable conditions. 

The third load change examined was actually a series of load changes 
based on an assumed movement of the busy hour from north to south 
during a four-hour period. It was further assumed that, relative to the 
busy hour, an area’s load was reduced 5 per cent in the adjacent hour, 
10 per cent in the second hour, and 20 per cent in the third. Traffic be- 
tween two areas (defined in the same way as for the Christmas loads) 
was taken to be the arithmetic mean of the levels of the terminal offices. 
That is, a traffic item between an area which is in its busy hour and one 
two hours distant is assumed to be reduced 5 per cent from its busy hour 
value. 

Since the networks were engineered based on a single over-all load 
value, some normalization was done so that the over-all network load 
remained approximately constant. There were also some limitations in 
the simulation which prevented the desired changes from being reached 
exactly, but the final loads were quite close to the desired value. The 
sequence of changes is shown in Fig. 7. The first part of the line repre- 
sents the basic engineered load, followed by the changes in each area’s 
traffic as shown. The ordinate is a relative scale, so all loads are given 
as multiples of the basic value. The ramps between the hours were 


ROUTING AND CONTROL 


2655 


actually simulated as shown, but no measurements were taken during 
these periods. The inter-area traffic levels are not shown, but are arith- 
metic means of the levels of the terminal nodes, as described above. 
This load change, which was applied to both full- and reduced-load 
networks, is designed to analyze a situation similar to that in the entire 
U.S.A., which has several time zones with a different busy hour in each. 
Although such differences arc, of course, not actually observable in the 
network selected, which runs essentially north and south, we can, for 
the purposes of modeling, assume it runs from east to west, and is ex- 
panded in its dimensions. In this case, time zone changes like those 
postulated for this ‘“‘busy hour load” would in fact be observed. 


Tasie II] —Oaxianp Variation Loaps (Fuut-Loap NEetworks) 





Traffic between Oakland 4M and 


Traffic Loads in Erlangs 





Normal Loads 


Changed Loads 





Bellingham, Wash. 
Seattle, Wash. 
Spokane, Wash. 
Yakima, Wash. 
Astoria, Oregon 
Bend, Oregon 
Klamath Falls, Oregon 
Medford, Oregon 
Pendleton, Oregon 
Portland, Oregon 
Roseburg, Oregon 
Las Vegas, Nevada 
Reno, Nevada 
Fresno, Calif. 
Modesto, Calif. 
Stockton, Calif. 
Redding, Calif. 
Sacramento, Calif. 
San Jose, Calif. 
Oakland Fr., Calif. 
Palo Alto, Calif. 
San Francisco, Calif. 
San Rafael, Calif. 
Santa Rosa, Calif. 
Bakersfield, Calif. 
San Luis Obispo, Calif. 
Compton, Calif. 

Los Angeles, Calif. 
El] Monte, Calif. 
Van Nuys, Calif. 
Anaheim, Calif. 

San Bernardino, Calif. 
San Diego, Calif. 


Total 





3.58 
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Fig. 7 — Busy hour load changes. 


3.6 Evaluation Criteria 


A communications network which must be engineered to meet a 
specific set of demands for service without being excessively costly, and 
which will then be subjected to demands for which it was never designed, 
is not easily evaluated by a single figure of merit, or even by a small 
number of parameters. The weight of overload performance versus engi- 
neered economy, performance under overload A as opposed to that 
under overload B, and service to traffic between points i and j as opposed 
to that provided between points k and m, provide ample opportunity 
for conflicting requirements. This is, of course, in addition to nontraffic 
considerations such as survivability, ease of engineering, administration 
and control, or ability to provide other services such as data and private 
line. 

Nevertheless, in order to make a comparative evaluation of various 
network configurations, a set of criteria must be adopted which can be 
evaluated for each network under study and which will reflect the basic 
considerations of cost and service quality under all conditions. 

The criteria which have been selected are four in number, two relating 
to cost and two relating to grade of service. They are: 
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(1) number of trunks required to provide the desired grade of service 
[of P(0.01)] at engineered load 

(2) number of trunk miles required to provide the desired grade of 
service at engineered load 

(3) the over-all weighted average blocking, B, as defined in Section II 
above 

(4) the dispersion of blocking Dz (subsequently denoted simply D), as 
defined in Section II above. 


Items (1) and (2) above can be provided with costs to derive approxi- 
mate network costs, which will vary depending upon the cost of switch- 
ing, terminal equipment and line facilities. This has been done for a few 
typical costs. The costs so derived are approximate because the trunk 
miles are defined as point-to-point airline miles, which is not the way 
actual facilities would normally be routed. 

Items (3) and (4) are measures of service quality. B is by itself a meas- 
ure of over-all network performance, and it is directly related to carried 
load. However, there may be severe distortions in the point-to-point 
blockings which would yield a low B but might still leave certain cus- 
tomers with extremely poor service. The inclusion of D as a criterion 
will help to identify such a situation and ensure that network service is 
evaluated on a basis of balance as well as blocking level. 


3.7 Operating and Control Procedures 


The variations in operating procedures and the control methods em- 
ployed all have the effect of changing the amount of alternate routing, 
normally also making different numbers of routes available to various 
point-to-point traffic items. Two control procedures were investigated 
for both the hierarchical and symmetrical networks. These were one- 
stage crankback and trunk reservation for first-routed traffic only (sub- 
sequently referred to simply as trunk reservation). One-stage crankback 
allows a call which has reached a point from which it is unable to proceed 
to back up one link along its previous route and attempt to complete via 
another route. This has been proposed as both a traffic improvement 
measure and as a means for allowing machine troubles to be circum- 
vented without customer retrials. The investigation here relates, of 
course, only to its effect on the traffic capacity of the network. Trunk 
reservation allows only first-routed traffic to seize the last idle trunk in 
a group. Alternate routed calls can be served only if at least m + 1 
trunks are idle, where m is the number of trunks reserved. This procedure 
tends to maximize the number of calls which are carried on direct links 
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at the expense of those carried on alternate routes. It also reduces group 
efficiencies somewhat, and the question is whether the reduction in 
circuitous routing is enough to compensate for this. 

Finally, for symmetrical networks only, the maximum number of 
links per call was varied. In the case called “full routing,’’ a maximum 
of five links was allowed for any call in the network. In the case called 
“limited routing,” a maximum of only three or four links per call was 
allowed, depending upon the connectivity between the originating and 
terminating points of the call. This restriction, of course, reduces the 
average number of links per call, at the same time reducing the number 
of routes possible between any two points. 


IV. ANALYSIS OF NETWORK CONFIGURATIONS 


4.1 Facility Requirements 


It is difficult to arrive at an accurate measure of the cost differential 
between the various network configurations, since costs of trunk termi- 
nations, switching, and trunk lines vary from place to place and from 
network to network. However, it is expected that the relative costs of 
the various network configurations can be deduced from the number of 
trunks and the number of trunk miles by applying appropriate factors 
related to the distribution of trunk lengths and the types of switching 
and transmission equipment in general use in any given situation. If the 
unit costs of switching equipment, or of control features inherent in the 
routing plan, are significantly different for different networks, the mag- 
nitudes of these differences can be balanced against the differences in 
trunks and trunk miles to again deduce the total network relative costs. 
It should also be noted that the distances used for the trunk length calcu- 
lations are based on airline mileage between originating and terminating 
points, which is ordinarily somewhat shorter than actual facility route 
mileage. This discrepancy can be corrected by introducing multiplying 
factors when determining network costs for any actual case. 

Table IV shows the number of trunks and trunk miles required to 
provide the noted grade of service for each of the networks under con- 
sideration, both in absolute value and as per cent difference from the 
two-level hierarchy, which was arbitrarily selected as the standard. Al- 
though the blocking probabilities are not exactly the same for all net- 
works due to inaccuracies in the engineering procedures and statistical 
fluctuations in the simulations, they are quite close. 

The differences in facilities required for the various networks, with the 
exception of the gateway, are quite small, amounting to at most 4.1 
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TaBLE ITV — CoMPaARATIVE TRUNKING REQUIREMENTS 





(a) Full-Load Networks 





























Trunk Miles : Trunks 
B 
Network i 
hp Actual % Diff. from Actual % Diff. from (Engineered) 
(000) 2-Level Hier. chu 2-Level Hier. 
2-level hier. 1174 0 6659 0 0.007 
3-level hier. 1154 —1.7 6679 +0.3 0.008 
Symmetrical 1129 —3.8 6727 4-L..0 0.007 
Gateway 1268 +8.8 9236 +38 .6 0.010 
(b) Reduced-Load Networks 
Trunk Miles Trunks 
B 
Network F 
i Actual | % Diff. from} a ctuay  (%_ Diff. from engineered) 
(00) — |2-Level Hier. crus 2-Level Hier. 
2-level hier. 6047 0 3298 0 0.008 
Symmetrical 5801 —4.1 3256 —1.3 0.008 





per cent difference in trunk miles and 1.3 per cent difference in trunks 
between the symmetrical and hierarchical reduced-load networks. 

The gateway network requires a much larger number of trunks and 
trunk miles than any of the others, reflecting the fact that many calls 
which in the other networks require only one link must use three in the 
gateway, and the fact that there is much excessive routing, or “back- 
haul” in traffic which is obliged to switch through gateways. In this 
case the resulting cost difference represents the savings in switching and 
line costs which would have to be achieved to offset the increased quanti- 
ties of equipment required. 


4.2 Costs 


Table V gives the costs of the various networks, assuming a range of 
ratios of line to terminal costs which should include most actual situa- 
tions. The differences between the hierarchical and symmetrical networks 
are quite small, as is that between the two- and three-level hier- 
archies, leading to a tentative conclusion that in these cases cost differ- 
ential is not a primary reason for selection of one network over another. 
It must be remembered, however, that the hierarchy (two-level) was 
engineered using a known and proven economical procedure, while no 
such method is available for the symmetrical networks. Therefore, the 
hierarchies are probably close to optimal, while some additional economies 
might ultimately be realized for the symmetrical networks. 
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TABLE V — Network Costs 











(a) Full-Load Networks 





Using $1500 Trunk Termination and Switching Costs 
































Cost/Trunk Mile 
$10/Trunk Mile $50/Trunk Mile $100/Trunk Mile 
Cc > Diff. f C 4 Diff. f C Diff. f 
Network $ (000,000) Meevel ten $(000,000) et ee $000,000) Pretti Hee 

2-level hier. 21.72 — 68.69 — 127.39 aor 
3-level hier. 21.56 —0.74 67.72 —1.34 125.42 —1.55 
Sym. 21.38 —1.57 66.54 —3.13 122.99 —3.45 
Gateway 26.53 +17.54 17.25 +12.46 140.65 +10.41 

(b) Reduced-Load Networks 

2-level hier. 10.99 — 35.18 — 65.42 — 

Sym. 10.69 —2.73 33.89 —3.67 62.89 —3.86 





Using the $50 per trunk-mile line cost figure, the data shown in Table 
V(a) indicate that there is about a 1.3 per cent savings in cost of trans- 
mission and switching facilities for a network of this size with full loads 
when a three-level rather than a two-level hierarchy is used, and another 
1.8 per cent if a symmetrical network is considered. This, of course, does 
not include any differences in signaling and control equipment which 
might be required to implement a symmetrical network, nor can it take 
account of the nonoptimality of the network engineering procedure now 
in use. 

The gateway network, as expected, costs about 12 per cent more than 
a hierarchy to carry the same traffic, assuming that trunk and terminal 
costs are the same for all networks. This difference will be somewhat 
mitigated by the fact that trunk and facility routes are more likely to 
be identical in the gateway than in the hierarchical configuration, and 
therefore the multiplier to convert from airline miles to facility miles 
may well be smaller. In addition, any savings in switching costs which 
can be effected because of the large volumes of traffic flowing through 
the gateway switches will of course work to the advantage of the gateway 
plan. 

Table V(b) indicates that the reduced-load symmetrical network is 
about 3.7 per cent less expensive than the hierarchy. This is in agreement 
with earlier results! which indicated that lightly loaded networks show 
greater differences between configurations than do heavily loaded net- 
works. 
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4.3 Overload Performance 


Fig. 8 shows the over-all average blocking probability, B, for four 
different full-load network configurations (two-level hierarchy, three- 
level hierarchy, symmetrical, and gateway), and for the load changes 
discussed in Section 3.5 above. The ‘“‘base” load under the “BH Runs” 
heading represents the same average load as the ‘“‘engineered”’ point. It 
is a shorter run, however, and any difference in blocking between the 
two points is due to statistical fluctuations. Only the points on the charts 
are meaningful, but lines have been drawn connecting them for visual 
clarity. Fig. 9 is a similar chart, showing the dispersion factor, D. Figs. 
10 and 11 show the same factors for the reduced-load networks, where 
only the two-level hierarchy and symmetrical networks were examined. 

Although there are apparently some differences in performance be- 
tween the various networks under various load change conditions, it is 
clear from Figs. 8 and 9 that there is no single superior network configu- 
ration in terms of traffic capacity and performance under shifting loads 
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Fig. 8 — Over-all average blocking, B: comparison of network configurations, 
full-load networks. 
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Fig. 9 — Dispersion of blocking, D: comparison of network configurations, 
full-load networks. 


at full-load levels. Some small systematic differences are present, such 
as the fact that the two-level hierarchy appears to give slightly lower 
blocking than all other networks at all points except the Oakland varia- 
tions case, where the gateway shows up best. This can, however, be a 
result of the initial engineered blocking level, which is slightly lower for 
the two-level hierarchy than for the three-level hierarchy or the gateway. 
This initial point does not so much denote a difference in performance 
under changed loads as it does the slight inaccuracies in engineering 
level, which are then reflected at every point on the chart. Although the 
simulation runs which produced these measurements used the identical 
set of calls for all networks at each load, the standard deviation of the 
results due to the finiteness of the simulation run is of the order of magni- 
tude of the blocking probability at each point, and firm conclusions can 
be drawn only if a distinct superiority of one configuration over another 
manifests itself at almost all of the points considered. There are some 
such uniform results, but the differences are quite small, and may be 
offset by the differences in cost discussed above. 
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Figs. 10 and 11, on the other hand, show a small advantage for re- 
duced-load hierarchical networks under all changed load conditions. In 
this case there is no initial error, and all evidence indicates that the 
hierarchy is slightly superior. It must be remembered, however, that the 
hierarchy costs somewhat more in this case, and this sensitivity to over- 
loads may simply be the penalty paid for a more economical network at 
engineered loads. 

The conclusion which must be reached from these results is that, for 
large networks with fairly high traffic densities, the performance of vari- 
ous alternate routing configurations in terms of traffic capacity under 
changing load conditions is quite similar. The reason for this is probably 
that the very density of traffic in these networks causes many of the 
trunk groups to be quite efficient, and the great bulk of the traffic is 
carried on the direct routes. Differences in alternate routing configura- 
tion, therefore, affect only a small proportion of the total traffic, with a 
correspondingly small effect on the network performance. In more lightly 
loaded networks, as has been observed, the differences are greater as 
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Fig. 10 — Over-all average blocking, B: comparison of network configurations, 
reduced-load networks. 
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Fig. 11 — Dispersion of blocking, D: comparison of network configurations, 
reduced-load networks. 


more of the traffic is alternate routed. Even in these cases, however, the 
differences are not large, and the comparison made here between sym- 
metrical and hierarchical networks shows the slight superiority of one 
in cost to be offset by better performance of the other under shifting 
loads. 


Vv. ANALYSIS OF OPERATING AND CONTROL PROCEDURES 


5.1 Full versus Limited Routing 


As discussed earlier, symmetrical networks were operated in two ways. 
In the first of these, called ‘limited routing,” a maximum of three or 
four links per call was allowed, depending upon the connectivity avail- 
able to the traffic parcel. In the second, called ‘full routing,” five links 
per call were allowed for all calls. Fig. 12 shows a comparison of the 
over-all average blocking for these two cases. It is clear from this figure 
that operation with limited routing is superior in traffic handling capac- 
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ity. Although the differences at any point are still small, and the statisti- 
cal variability of the results large, the fact that there is an advantage for 
the limited routing case for every point tested indicates that this is a 
real effect, and not merely a result of chance observation. Furthermore, 
since these two curves represent the same network in terms of trunk 
layout, there is no possibility of complicating or compensating factors 
due to cost differences or engineering errors. In fact, the difference in 
blocking probability under engineered loads in this case does not repre- 
sent an engineering error, but instead an additional verification of the 
fact that operation with limited routing is superior. This result is further 
evidence of the fact that excessive alternate routing can cause service 
deterioration, even under light load conditions. (The routing used in 
the symmetrical networks discussed earlier was limited routing, chosen 
because it gave superior performance.) 

The symmetrical network whose performance is plotted in Fig. 12 
and in subsequent graphs is clearly not identical to that discussed previ- 
ously, since the blocking probabilities at all points are somewhat higher. 
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Fig. 12 — Over-all average blocking, B: full vs limited routing; full-load, high- 
blocking, symmetrical network. 
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Fig. 13 — Over-all average blocking, B: full-load symmetrical networks, effect 
of crankback. 
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Fig. 14 — Dispersion of blocking, D: full-load symmetrical networks, effect 
of crankback. 
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This network, however, is the one which originally resulted from the 
engineering program, and it will be used for all studies concerned with 
differences in operating method using the same network. The earlier 
comparisons between network configurations on a basis of both cost and 
performance required that the blocking probability be approximately 
equal at engineered loads, and trial and error modifications were made 
to the symmetrical network to bring its blocking probability down to 
the proper level. The comparisons between different modes of operation 
of the same network should not.be significantly affected by exact level of 
blocking at engineered loads and are expected to be valid for all networks 
of approximately the traffic densities considered. In the investigation of 
the effectiveness of crankback, however, symmetrical networks with 
both low and high over-all blocking probabilities were examined. 


5.2 Crankback 


Comparisons of networks operating with and without crankback 
were made for hierarchical and symmetrical networks using both the 
full traffic and reduced traffic models. In the hierarchical networks, no 
significant difference in behavior could be detected between the networks 
operated with and without crankback. This is because the structure of 
the hierarchical network is such that most of the blocking occurs on final 
route links, which are impossible to avoid even with the crankback 
option. For example, in Fig. 3(a) if a call from 1 to 2 is blocked at node 
4, it may, with crankback, back up to node 1 and attempt to reach node 
2. Even if this is possible, however, there is still a large probability of 
being blocked on route 3-2, and hence rearriving at 4 at some later time. 
In addition, those calls which do get through using the crankback option 
tend to use relatively long routes, causing later calls between other points 
to be blocked. 

The over-all network blocking and dispersion of blocking for sym- 
metrical networks with and without crankback are shown in Figs. 13 and 
14 for full-load networks and in Figs. 15 and 16 for reduced-load net- 
works. Figs. 13 and 14 have curves for both the symmetrical network as 
originally engineered (hi-block) and the corrected symmetrical network 
which was used for comparison with the hierarchy (lo-block). This was 
done so that any differences introduced by the general level of blocking 
would be apparent. The curves indicate that the level of blocking has, 
at most, marginal significance at these loads, and that crankback de- 
grades the network performance at all but the lowest blocking levels, 
when it has virtually no effect. The networks here are operated with 
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limited routing, but a similar test with full routing yields results which 
are substantially identical to those shown. 

Figs. 15 and 16 show that crankback does offer a small advantage for 
less heavily loaded networks, although this advantage tends to disappear 
as the load increases, regardless of its distribution. 

These results indicate that for large networks, operation with crank- 
back at best offers a slight improvement in service when the service is 
good, and makes matters worse when the situation begins to deteriorate. 
An examination of the trunk occupancies and number of links per call 
shows that operation with crankback generally causes a larger number 
of links per call to be used on the average, with a higher over-all trunk 
occupancy. In effect, it therefore increases the amount of alternate 
routing allowed, and not always in the best way, so that degradation 
under overloads is a certainty. It therefore must be recommended that 
this device not be incorporated into large switching networks unless 
survivability, improved reliability, or other factors dictate it. If it is 
incorporated into a network for reliability or other purposes, means 
should be made available to disable it under overloads. 
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Fig. 15 — Over-all average blocking, B: reduced-load symmetrical network, 
effect of crankback. 
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Fig. 16 — Dispersion of blocking, D: reduced-load symmetrical network, 
effect of crankback. 


5.3 Trunk Reservation for First-Routed Traffic 


Figs. 17 through 22 show the effect of trunk reservation for first-routed 
traffic on the blocking and dispersion of full- and reduced-load symmetri- 
cal and hierarchical networks. This measure, which reduces the amount 
of alternate routing on a selective basis, provides a uniform improvement 
in performance for all networks shown, although the improvement is 
more marked in the case of full-load than in reduced-load networks. The 
two-level hierarchies were not noticeably affected by the introduction 
of this measure. 


In general, one trunk was reserved in each trunk group in the 
network, although two trunks were reserved on every group in some 
cases. It was generally found that reserving more trunks than 
noted in the charts had little additional effect upon the network per- 
formance. Figs. 17 and 18 show the effect of trunk reservation on sym- 
metrical full traffic networks. It is interesting to note that the network 
with full routing has almost identical performance to the network with 
limited routing when trunk reservation is used. This is not illogical, 
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Fig. 17 — Over-all average blocking, B: full-load, high-blocking symmetrical 
network, effect of trunk reservation. 


since trunk reservation has a gross effect similar to that introduced 
by limiting the number of links per call. 

Figs. 19 and 20 show the effect of trunk reservation on a three-level 
hierarchical network, and here we observe an improvement similar to 
that seen in the examination of symmetrical networks. 

Figs. 21 and 22 show the blocking and dispersion for the reduced- 
traffic symmetrical network, in which the effect is similar but of lesser 
magnitude than that observed in the full-load networks. 

It is quite likely that a selective application of trunk reservation to 
those groups which are large and have a large proportion of alternate 
routed traffic would be more effective than the across the board applica- 
tion used here. However, this study suffices to show that there is an 
advantage in the traffic handling capability of a network so equipped, 
and more detailed analysis will be required to determine the best number 
of trunks to be reserved in any given case. 

Trunk reservation has essentially the opposite effect on the network 
as crankback; it reduces the amount of alternate routing during periods 


ROUTING AND CONTROL 2671. 


of momentary congestion, preventing calls from being completed using 
circuitous routes at such times. Subsequent calls are then not affected 
and the over-all network performance is improved. 

One test was made using both trunk reservation and crankback, but 
the effect of trunk reservation appeared to dominate, and no difference 
was observed whether crankback was or was not used. 


VI. CONCLUSIONS 


The first and most obvious conclusion to be drawn from the preceding 
results is that for networks with a high traffic density the selection of 
routing doctrine and control philosophy does not have any great effect 
upon the traffic handling capability of the trunking network. This fact 
is apparently due to the substantial trunk group size generally encoun- 
tered in such networks, with the basic group efficiency sufficiently large 
to obviate any spectacular improvements due to clever routing or control 
schemes. Of course, these comments apply only to reasonable alterna- 
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Fig. 18 — Dispersion of blocking, D: full-load, high-blocking symmetrical 
network, effect of trunk reservation. 
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Fig. 19 — Over-all average blocking, B: three-level hierarchical full-load net- 
work, effect of trunk reservation. 


tives, such as those examined here. It is possible to develop a routing 
plan which would encourage circuitous routing at the expense of direct. 
Such a scheme would almost certainly show significantly poorer behavior 
than any of the networks investigated. 

Planning for future networks should then be initially concerned with 
other factors, such as economics, survivability, flexibility and so forth, 
with a precise evaluation of traffic capacity to be determined after the 
fundamental design considerations are well formulated. 

Having once accepted the basic idea that all differences are small in 
magnitude, we can nevertheless observe their direction, and, in the 
event that there are no other significant factors, decisions can be made 
on the basis of such small differences. A saving of one per cent in the toll 
trunk plant in the U.S.A. alone, for example, would amount to many 
millions of dollars, which is not insignificant in magnitude, even though 
it is a small fraction of the total network cost. 

In the comparison of network configurations, the symmetrical net- 
works have some cost advantages, particularly at lower load levels. This 
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is to some extent offset by a tendency to deteriorate under overload 
slightly more rapidly than hierarchical or gateway networks. Further- 
more, there is likely to be a not insignificant additional cost connected 
with the operation and control of such networks, and they are difficult 
to engineer and administer. They do have the advantage of improved 
survivability, however, since there is not so much concentration of facili- 
ties at regional switching centers. 

The gateway network behaves well under overloads, but requires too 
high an initial cost to warrant its use with existing technology. If tech- 
nological advances radically change the patterns of costs for such a net- 
work, then the gateway may be a suitable selection. The survivability 
aspects of these networks are particularly important, since sections of 
the network can be isolated by the destruction of a few critical points. 

The hierarchical networks, which were the first alternate routing net- 
works to be put into service, show a competitive initial cost and a reason- 
able reaction to shifting loads of all sorts. They are simple to engineer 
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Fig. 20 — Dispersion of blocking, D: three-level hierarchical full-load network, 
effect of trunk reservation. 
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Fig. 21 — Over-all average blocking, B: reduced-load symmetrical network, 
effect of trunk reservation. 


and administer, and the logic associated with switching and routing 
control is relatively uncomplicated and economical. They pose an obvious 
survivability problem, since some traffic parcels have access to only a 
single route. This situation can be largely alleviated by dispersion of 
routes and liberal provision of high-usage groups. 

In short, if a high-density communications network is desired, and 
concentration of traffic along backbone routes is allowable, then a hier- 
archical network is likely to be the best choice of network structure. As 
the traffic density declines, the symmetrical networks begin to show to 
advantage, and they are indispensable in some form if the survivability 
requirement rules out hierarchies. Symmetrical networks should, how- 
ever, be implemented only in conjunction with an operating technique 
such as trunk reservation to maintain overload capability. 

The investigations of control measures demonstrate conclusively that 
crankback is ineffective or harmful in all networks except perhaps those 
with extremely light traffic densities. It offers at most a small gain at 
engineered loads, and aggravates undesirable overload effects. There 
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would therefore appear to be no reason for providing it other than the 
nontraffic one of improving the ability of a call to avoid an equipment 
malfunction. If it is used for this purpose it should be disabled under 
overload, when it shows the greatest traffic disadvantage. 

Trunk reservation, on the other hand, almost always improves the 
traffic carrying capacity of networks, and is never harmful. It is an inex- 
pensive measure to implement which is unquestionably worth using, 
and further studies of the strategy and extent of its use should be under- 
taken. 

In sum, the basic factors relevant to the design of communications 
networks are: 

(1) If there is a high density of traffic, and traffic concentration on 
backbone routes is allowed, then a hierarchical configuration probably 
should be selected, with the number of levels dependent upon the par- 
ticular situation. 

(2) If the traffic density is lower and/or the hierarchy is unacceptable 
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Fig. 22 — Dispersion of blocking, D: reduced-load symmetrical network, 
effect of trunk reservation. 
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for survivability reasons, then a symmetrical network may be more 
economical and can perform well if properly controlled. 

(8) Crankback should not be used, except possibly as a means of 
alleviating the effects of equipment troubles. If used, its traffic disad- 
vantages under overloads should be taken into account. 

(4) Trunk reservation should be widely employed, since it is simple 
to implement and has noticeable traffic advantages under all load condi- 
tions with almost any network configuration. 

Although these guidelines are, of course, qualitative in nature, this is 
necessary because of the large number of variables which exist in an 
actual network. Variations in traffic levels between and within networks, 
geographical distributions of switching offices and densities of traffic, 
equipment limitations and differing primary functions all lead to different 
constraints and weightings of various factors. It is the purpose of these 
studies to provide guides for the design of communications networks, 
with final choices dependent upon specific factors. 
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An Experimental Study of Near-Field 


Cassegrainian Antennas* 


By D. C. HOGG and R. A. SEMPLAK 
(Manuscript received May 30, 1964) 


The near-field Cassegrainian antenna is a double-reflector system that 
employs, in its sumplest form, confocal paraboloids. Unlike the standard 
Cassegrain which employs a hyperboloidal subreflector illuminated by a 
spherical wave, the near-field device is fed by a uniform phase front. Hxpert- 
mental data on notse performance, gain, and radiation patterns have been 
obtained at a frequency of 6 gc using two 16-foot paraboloids (focal length- 
to-diameter ratios of 0.875 and 0.25) in both standard and near-field con- 
Jigurations. 

Using the shallow antenna, zenith noise temperatures of 10°K and 
6°K were obtained for the standard and near-field systems, respectively; 
at an elevation angle of 10° the antenna temperatures were 50°K and 20°K. 
Using the deep secondary reflector, zenith noise temperatures of 4°K were 
obtained for both configurations; at 10° above the horizon, however, the 
standard Cassegrain has an antenna temperature of 30°K and the near- 
jield device 18°K. In all cases, the antenna efficiencies are not far above 50 
per cent. Discussion of notse produced by various methods of mounting 
subreflectors is included. Since noise produced by transmission lines and 
antenna environment ts closely related to these experiments, it 1s discussed 
in detail in appendices. 


I. INTRODUCTION 


Large microwave antennas of high efficiency and low noise are de- 
sirable in radio astronomy, in tracking of space probes and in satellite 
communications. In all of these cases, convenient access to the asso- 
ciated electronic equipment is also a desirable feature. The horn re- 
flector! is an antenna which provides this access and also admirably 
satisfies the electrical requirements. Nevertheless, it is of interest to 


* Part of this material was presented to the URST in Washington, D.C. (May, 
1962). 
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examine other types of microwave antennas of more favorable ratio of 
geometrical aperture to total size with a view to improvement of their 
electrical performance toward that of the horn reflector. 

The purpose of this study is twofold: to evaluate the near-field Casse- 
grain as a microwave antenna, and to compare its noise performance 
with that of other antennas. Actually, two 16-foot diameter paraboloids 
have been tested, one with an f/D ratio of 0.375 and the other of 0.25. 
Measurements of antenna noise temperature, gain, and radiation pat- 
terns were made at a frequency of 6 ge using various feeding arrange- 
ments on both of these main reflectors. 

Most paraboloids have relatively low aperture efficiencies and exhibit 
poor noise performance. For example, paraboloids fed by a horn at the 
focal point typically have intrinsic (back lobe) noise temperatures of 
20 or 30 degrees Kelvin,?*+ whereas the equivalent noise for the horn 
reflector is about 2°K.5:5.7 This noise is due to thermal radiation from 
the environment of the antenna (mainly the ground) into the wide- 
angle or back lobes of the antenna; in what follows, it is designated by 
Pe 

Paraboloids fed by a source at the focus suffer from another deficiency: 
either the first circuit of the receiver must be mounted at the focal point 
(which is inconvenient), or a rather long transmission line (which re- 
sults in a prohibitive increase in noise) must be provided. This unde- 
sirable feature is overcome by use of the Cassegrainian configuration’ :? 
which, in the usual arrangement, has a point source feed at the apex of 
the main (secondary) reflector and a hyperboloidal (primary) subre- 
flector near the focal planc. In this case, the receiving equipment may be 
situated at the apex of the secondary reflector, free space serving as the 
transmission medium to the subreflector. Often the location of equip- 
ment near the apex is restrictive; depending on the arrangement, it 
may or may not move with the main reflector. This type of feed is re- 
ferred to here as the “standard” Cassegrain. 

The near-field Cassegrain combines some of the useful properties of 
the horn reflector with those of the standard Cassegrain. Rather than a 
point-source feed at the apex of the main reflector, a plane-wave feed 
of the same dimension as the subreflector is used.* Of course, the sub- 
reflector blocks the field of the main aperture just as in the case of the 
standard Cassegrain configuration. The plane-wave feed used for the 
measurements to be discussed was a small horn-reflector antenna. This 
arrangement allows the electronic equipment to remain stationary while 

* Experiments on an antenna of this type were described recently by Profera 


et al.!° Some generalized antenna systems based on this concept are discussed by 
S. P. Morgan." 
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the elevation angle of the antenna is changed, in much the same manner 
as with the horn-reflector antenna. 

It should be mentioned that the near-field feeding system is suited 
only to antennas that are very large compared to the wavelength. In 
the model that has been tested here, where the antenna diameter is less 
than 100A, this criterion is just met. However, it appears that the feed 
system is broadband, embracing all wavelengths shorter than that 
satisfying the criterion, and in this sense the near-field antenna is some- 
what similar to the horn reflector. Methods for mode scanning” a horn- 
reflector antenna are equally applicable to a near-field Cassegrain. 

In Section II, the geometry of the near-field Cassegrainian antenna is 
discussed; the fields produced by the near-field feed are also given 
there. Section III describes the equipment, siting, and the methods used 
for measurement of antenna noise temperature and gain. Sections IV and 
V contain the noise and gain measurements on the shallow and deep 
sixteen-foot paraboloids using various types of feeds; the effect of subre- 
flector mounting structures on noise performance is also given in those 
sections. Measurement of noise due to loss in transmission lines is dealt 
with in Appendix A. In Appendix B, the back-lobe noise temperature 
(T,) for an antenna in a given environment is discussed, and in Appendix 
C, a quality factor which governs the signal-to-noise ratio in antennas 
is proposed. 


Il. THE NEAR-FIELD CASSEGRAINIAN ANTENNA 


2.1 Comparison of Standard and Near-Field Cassegrainian Antennas 


The standard and near-field Cassegrainian antennas are compared 
in the idealized sketches of Fig. 1. An extensive analysis of the standard 
Cassegrain antenna has been given® and it will not be discussed further 
here; however, it should be noted that radiation from the point-source 
feed tends to spill over the rim of the hyperboloid. It has been demon- 
strated recently that suitable beam shaping of the source pattern can 
reduce this spill-over. The receiving equipment is stationary as the 
antenna changes elevation, a right-angle circular waveguide bend and 
rotating joint being provided in the transmission line (see Fig. la). A 
simple right-angle bend would not be used in systems employing circular 
polarization since, due to unequal phase velocities of orthogonal com- 
ponents, the circularity would be degraded. A simple bend was used in 
the measurements to be discussed since circular polarization was not 
involved. 

The near-field Cassegrain, shown in Fig. 1(b), has a horn-reflector 
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Fig. 1 — Idealized standard and near-field Cassegrainian antennas. 


feed with an aperture of about the same diameter as that of the sub- 
reflector. To a geometrical optics approximation, the near field of this 
feed is collimated and of uniform phase. 


2.2 Geometry of Near-Field Cassegrain 


A simple derivation shows that the surface of the subreflector in the 
near-field configuration should be paraboloidal. Assume that the sur- 
face of the subreflector (see Fig. 2) is paraboloidal; it will then be suffi- 
cient to show that the path length of any ray from the plane wave in 
the feed aperture HF to a reference plane in the secondary aperture is 
constant. Consider the ray of path length AB + BC + CD. Equating 
this path to the length of the axial ray, one has 


AB + BC + CD = 2f —fi) +f = 3f — 2h (1) 


where f, and f are the focal lengths of the primary and | secondary re- 
flectors.” From Fig. 2, the line segments AB, BC, and CD are equal to 
f — 2,7 — 11, and 7 cos @ respectively. Equation (1) then becomes 


f-atnm—n+trmcos @ = 3f — 2f (2) 


* Primary and secondary are used to designate the sub- and main reflectors 
respectively because the radiation patterns of the feed and main reflector are 
usually referred to as primary and secondary patterns. 
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where 2) = 7 cos 6, and from the equations of the paraboloids 


_ af _ 2f 
eee | + cos 0 and: 3S 1 + cos 6° 
Making these substitutions, (2) becomes 
oe 2f,(1 + cos 8) ue 2f(1 + cos 6) = 3f — Of, 


1 + cos 6 1 + cos 6 


which proves the equality. 

As in the case of the standard Cassegrain, the degree of illumination 
on the surface of the secondary reflector of a near-field Cassegrain can 
be varied by using subreflectors of various focal lengths. Optimum 
illumination, as determined by geometrical optics, is achieved by using 
a subreflector of f/D ratio identical to that of the secondary reflector. 

Fig. 3 is an idealized sketch of the near field along the axis of the 
source aperture, the relative positions of subreflectors used in the 
experiments being indicated by arrows. Note that the subreflectors are 
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Fig. 2— Geometry of near-field Cassegrainian antenna. 
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Fig. 3 — Idealized near-field along the axis of the feed. 


well out in the near-field. As mentioned in the introduction, the antenna 
size and wavelength used for these tests are far from optimum for the 
near-field type of feed. Preferably one would choose dimensions as large 
as possible with respect to wavelength (a high-gain antenna) in which 
case geometrical optics would hold more rigorously; an immediate conse- 
quence of this is that the subreflector positions (see Fig. 3) would be 
located where collimation and phase uniformity of the near-field are 
greatly improved. 


2.3 The Conical Horn-Reflector Feed 


A horn-reflector antenna" was used as the near-field feed for both the 
shallow and deep dishes. Theoretical and experimental studies of the 
far-field characteristics of this antenna have been published;? here, 
discussion is confined to its near-field characteristics. 

The first 16-foot diameter secondary reflector used in the near-field 
Cassegrainian configuration had a focal length of 6 feet; therefore 
measurements of amplitude and phase of the field were made six feet 
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in front of the aperture of the horn reflector, where the subreflector 
was to be mounted. The measurements were made in an anechoic cham- 
ber using a dipole probe. The data are plotted in Fig. 4 along with 
theoretical curves; these compare favorably in their general trend but 
not in detail. One notes that the phase departs from uniformity by 
about +10° in some cases and that it is unsymmetrical with respect to 
the axis. 


III. EQUIPMENT AND METHODS OF MEASUREMENT 


3.1 Equipment 


Fig. 5 shows a complete antenna mounted on a motorized turntable 
carriage for azimuthal rotation. The cab to the left of the antenna is 
shielded; it houses the necessary equipment for measuring noise tem- 
perature, gain, and radiation patterns. The double A frame and cradle 
structure on which the secondary reflector is mounted is shown more 
clearly in Fig. 9 (p. 2691); it is a strong structural unit, no demand being 
made of the paraboloid for supplying rigidity. Elevation steering is 
provided by rotation of the cradle on bearings in the A frames. The 
transmission line, a circular waveguide of 2.8-inch diameter, is fed to 
the receiver in the cab through the bearing via a rotating joint. 

The antenna is sited in a relatively flat, clear area; however, the site 
is ringed with trees which limit the horizon to an average elevation angle 
of 1.5 degrees. 

The 6-ge maser receiver used for antenna noise temperature measure- 
ments has been discussed previously.*” 


3.2 Method of Noise Measurements 


The setup used for measuring noise performance is shown in Fig. 6. 
The noise temperature at the input to the converter with the noise lamp 
off is given by 


Pie = Gm( Ta ss Tn) (3) 


where G,, 7’ are the maser gain and noise temperature, and 7, = 
T, + T: + T, is the antenna temperature. 7’, is the sky temperature 
observed by the main beam. 7 is the noise temperature of the trans- 
mission line associated with the antenna.” 7’, is the noise contribution 
from the earth and sky through the wide angle side and back lobes of 


* Here we neglect the actual attenuation due to transmission line loss; it is 
quite small. 
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Fig. 4 — Near-field patterns of horn reflector (Z = 6 ft.). 
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Fig. 5 — A 16-foot diameter paraboloid (f/D = 0.375) with near-field Casse- 
grainian feed. 
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Fig. 6 — Noise contributors in the receiving system. 
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the radiation pattern. With the noise lamp fired, the noise input to the 
converter becomes 


Tic = GmA(Ta + Tn + Tr) + (1 — A)To (4) 


where 7’, is the additional noise introduced by the calibrated noise lamp, 
T, the ambient temperature and A the reciprocal of the additional loss 
introduced by a precision attenuator to equalize (8) and (4). 

Solving (3) and (4) for 7. , 


AT. To 
LOA G. 
Since the terms on the right of (5) are determined by independent 


measurement, the noise temperature due to the back lobes is obtained 
from 


de — == j eee (5) 


m%=T.-T,-—7T1 (6) 


provided 7, and 7; are known. 

The sky temperature, 7; , for an atmosphere of given humidity is 
known from experience.* The transmission line contribution, 71, is 
measured independently as discussed in Appendix A. 


3.3 Method of Gain and Radiation Pattern Measurement 


Gain measurements were made by comparing the power received by 
the antenna with that of a standard horn. A height run was made with 
this horn over the vertical extent of the main paraboloid for each gain 
measurement, the average of these data being taken as the reference 
value. Using the same equipment, azimuthal radiation patterns were 
obtained for the two principal polarizations. 


IV. MEASUREMENTS ON THE SHALLOW PARABOLOID (f/D = 0.375) 


4.1 Noise Measurements Using Various Feeds 


The 16-foot diameter spun-aluminum shallow paraboloid (f/D = 
0.375) was first fed in a conventional manner using a cylindrical wave- 
guide horn supported at the focal point by fiber glass struts, the feed 
waveguide running out from the apex. The radiation pattern of this 


* In assigning values to 7; , the absolute water vapor density at the ground is 
determined from humidity and temperature measurements at the receiving site. 
Based on the particular value of water vapor density obtained, theoretical sky 
temperatures which have been verified previously’ are calculated. 
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feed tapers to about —10 db at the rim of the paraboloid; this, in addi- 
tion to an inverse distance effect of 3 db, results in a net illumination 
taper of 13 db. 

Using the method for measuring noise discussed in Section 3.2, the 
two measurements on the curve of Fig. 7 labeled A were obtained* 
for zenith angles (@) of zero and 45°. In spite of the relatively strong 
taper, this feed produces a noise temperature of 26°KK at both angles. By 
removing the fiber glass mounting struts and supporting the feed with 
fine guy ropes, the point labeled B was obtained, 7, = 13.5°K. Com- 
parison of the zenith point on curve A with point B shows that the fiber 
glass struts contribute 12.5° of noise. Strut noise can be produced both 
by reflection of noise radiated from the ground into the antenna and by 
loss in the material comprising the strut; this point is elaborated upon 
later. 

The second feeding arrangement measured on this paraboloid was a 
standard Cassegrain consisting of a precision hyperboloid (whose diam- 
eter could be changed from 30 to 24 inches by removal of an outer ring) 
fed by a horn of 3.5 diameter located at the apex of the main dish. The 
transmission line from horn to maser was about six feet of oversized 
circular guide including a right-angle bend. 

With the hyperboloid mounted on fiber glass struts, noise measure- 
ments (at the zenith) produce 7; = 24.5°, as shown by point C in Fig. 7. 
The fiber glass struts were then covered with aluminum foil, essentially 
converting them to metal struts of the same geometry, and another 
measurement made, as indicated by the zenith value on curve D. At 
zenith the noise is now 7; = 11°K. Comparing this value with the 24.5° 
obtained using fiber glass struts, one sees that a decrease in noise of 
13.5° has been effected. This result shows that most of the noise (at 
least for zenith orientation of the beam) is produced by loss in the di- 
electric; this conclusion seems to be valid because the decrease of 13.5° 
compares very favorably with the 12.5° decrease in 7, obtained when 
the fiber glass struts were removed during the test using the waveguide 
feed. Measurements were also made versus zenith angle, as shown by 
the remainder of curve D in Fig. 7. As the horizon is approached, 7, 
reaches values of the order 50°K. This noise is produced by spill-over 
beyond the rim of the hyperboloid and by reflection of noise from the 
earth by the sizable struts. 

The last feeding arrangement to be discussed is the near-field Casse- 
grainian configuration. Fig. 5 shows a front view; mounted near the 


* It may be well to mention here again that 7, in Fig. 7 is the intrinsic antenna 
noise; sky and waveguide noise, etc. are not included. 
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Fig. 7 — The 6-ge noise performance of 16-foot paraboloid (f/D = 0.375). 


focal plane is one of the several paraboloids used as primary reflectors. 
The mounting struts are of interest; they are metallic, light-weight, 
small, and extend to the rim of the secondary reflector, the latter being 
done so that the struts do not intercept direct radiation from the sub- 
reflector. 

Noise data obtained for this feed using a primary reflector of 12-inch 
focal length are shown in Fig. 7 as curve E. T; at the zenith for this 
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arrangement is less than 8°K. Equally important, however, 7; is less 
than 20°K in the region of 6 = 80°, which is approximately the acquisi- 
tion angle in a satellite communication system. At this angle the sky 
noise’ is relatively high compared to the zenith value and it is desirable 
to have T; as low as possible. 

By using a subreflector of longer focal length (14.5 inches), much 
less spill-over of the secondary reflector occurs and a zenith temperature 
of 4°IX was achieved. For all elevation angles with this subreflector, 
T, was of the order of one-half that obtained with the subreflector of 
12-inch focal length; of course, the secondary area illuminated is rather 
small and the gain is reduced by about 1.5 db; thus use of such sub- 
reflectors is of questionable value. A discussion of the signal-to-noise 
ratio in antennas is given in Appendix C. 

To pursue the strut-noise effect further, a pressurized Mylar sheath 
support (similar to that shown in Fig. 9) with a wall thickness of 1.5 
mils and inflated with nitrogen to a pressure of 0.15 psi was devised; it 
provides a remarkably rigid support. Nichrome guy wires from the 
subreflector to the rim of the secondary reflector are used for centering. 
Noise data for this arrangement (curve F in Fig. 7) show that there is 
an Improvement of about 2°K (compared with curve E) for all eleva- 
tion angles. 

Curves G and H are included in Fig. 7 to serve as reference data. 
Curve G was obtained using the 27-inch horn-reflector feed (itself a 
low-noise antenna of respectable size with a far-field beamwidth of about 
5°) mounted on the 16-foot paraboloid, struts and subreflector bemg 
removed. The arrangement amounts to a well shielded horn reflector 
with a large baffle (the secondary reflector); as indicated in curve G, 
Ty, is less than 1°K at zenith. The rather rapid increase to 7, = 9°K 
at @ = 80° is attributed in part to the limited horizon” mentioned in 
Section 3.1. Curve H was obtained using a five-foot horn reflector at 
the Crawford Hill site, which has a clear horizon.’ 


4.2 Radiation Pattern, Gain, and Impedance 


The idealized near field of the conical horn feed was shown in Fig. 1 
as a plane wave perfectly collimated in the direction of the primary 
reflector. In reality this is not the case; as discussed in connection with 
Fig. 4, the phase varies as much as +10° in places and is unsymmetrical 
with respect to the Z axis, as is the amplitude. From an academic point 

* The decrease in noise very near the horizon shown by the curves in Figs. 7 


and 10 is also attributed to the environment; this effect was not observed using the 
Crawford Hill site (Refs. 5 and 7). 
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of view, a plane-wave source with a symmetrical aperture field (such as 
a horn-lens antenna) would be preferable as the feed, since the unsym- 
metrical phase produced by the horn reflector produces unsymmetrical 
radiation patterns in the near field. Evaluation of the field distribution 
in the aperture of the main dish is also a near-field diffraction problem. 
Preliminary calculations, using idealized fields, indicate that this distri- 
bution is toroidal; the secondary patterns produced therefore have 
relatively high immediate side lobes, similar to those of a heavily blocked 
aperture. 

Fig. 8(a) shows the measured pattern for longitudinal (horizontal) 
polarization, using the subreflector of 12-inch focal length. Note that the 
side lobes are unsymmetrical and that the highest one is only some 13.5 
db down from the main lobe. In Fig. 8(b), for transverse (vertical) 
polarization, the immediate side lobes are more than 20 db down. In 
both cases the half-power beamwidth is about 0.7°. For comparison, 
a calculated curve for constant amplitude distribution and 2 per cent 
aperture blocking is also shown in Tig. 8. The gain is 48.0 db for trans- 
verse and 47.4 db for longitudinal polarization. The calculated full area 
gain of the 16-foot dish is 50.1 db at 6.3 kme; thus the average effective 
area is 2.4 db down or, in other words, the efficiency is 57.5 per cent. 

The SWR for the above configuration is 1.17, equivalent to a return 
loss of 22 db. A slight improvement in impedance is obtained by remov- 
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Fig. 8 — Radiation patterns of shallow paraboloid. 
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ing a circular area from the subreflector and illuminating the concave 
surface of the subreflector in the proper phase. This supplementary 
feeding arrangement (shown clearly in Fig. 5) resulted ina VSWR of 1.15 
or a return loss of 238 db. 





Fig. 9 — 16-foot paraboloid (f/D = 0.25) with near-field Cassegrainian feed. 
Note Mylar support for subreflector. 


V. MEASUREMENTS ON THE DEEP PARABOLOID (f/D = 0.25) 


5.1 Noise Measurements Using Various Feeds 


Performance was next examined using a deep paraboloid (f{/D = 0.25) 
as secondary reflector; Fig. 9 shows a front view. This paraboloid (diam- 
eter 16 feet) was machined from urethane foam, a reflective surface of 
zine being applied after machining.* Near the focal plane is one of the 


* Precision in reflecting surfaces is an important factor in determining the 
wide angle lobes!5 and therefore the noise performance of antennas. 
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paraboloids used as a primary reflector, in this case supported by a 
Mylar sheath. An alternative support used four half-inch struts extend- 
ing perpendicularly from the mounting ring surrounding the feed 
aperture to the subreflector. 

Curve A of Fig. 10 shows data obtained using a 30-inch diameter 
subreflector of focal length 7.5 inches supported by metal struts; as 
indicated, the zenith temperature is about 4°K. Next, a 24-inch diam- 
eter subreflector with a focal length of 6 inches was used, the f/D ratio 
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Fig. 10 — The 6-ge noise performance of 16-foot paraboloid (f/D = 0.25). 
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being the same as in case A. As indicated by curve B, this feed arrange- 
ment displays very good characteristics, also achieving a zenith tem- 
perature of 4°IK. Equally important, 7, remains less than 10°IK until a 
zenith angle of sixty-five degrees is reached. In the region of 6 = 80°, 
T, is less than 15°IKx. 

Even though the struts supporting the subreflector were of small size, 
it was of interest to see what the change in 7; would be if the struts 
were removed. Fig. 9 shows the nitrogen-filled Mylar sheath (previously 
discussed in Section 4.2) supporting the 24-inch diameter subreflector, 
and curve C in Fig. 10 shows the data obtained. There is no appreciable 
change in the zenith noise temperatures; however, there is improvement 
for angles approaching the horizon, indicating that the small struts were 
to some extent scattering the earth’s radiation into the antenna. 

A standard Cassegrain feed consisting of a precision 30-inch diameter 
hyperboloid machined from styrofoam and suitably surfaced with zine 
and a 3.5\ diameter horn located at the apex of the secondary reflector 
was tested for comparison with the near-field feed; the noise measure- 
ments are shown as curve D in Fig. 10. At zenith, the noise temperature 
is 4°K, but spill-over effects quickly become apparent when the zenith 
angle exceeds sixty degrees. At 6 = 80°, 7; has increased to about 30°K; 
this is attributed mainly to spill-over beyond the rim of the subreflector. 

Reference curves E and F are included in Fig. 10; similar data were 
discussed in connection with Fig. 7. 


5.2 Radiation Pattern, Gain and Impedance 


Tig. 11 shows the azimuth patterns for the deep dish using the 24- 
inch diameter paraboloid as primary reflector. In Fig. 11(a) (ongitudi- 
nal polarization) the immediate side lobes are unsymmetrical and the 
highest one is about 13.0 db down. Figure 11(b) is the secondary pattern 
for transverse polarization in the feed. In both cases, the 3-db beamwidth 
is about 0.7°. 

During the measurements, the well known problem of properly 
illuminating a deep paraboloid became apparent; however, it was 
readily determined that deep reflectors may be illuminated more easily 
by Cassegrainian techniques than by focal point feeds. For example, 
using the near-field feed, the illumination at the rim of a 30-inch sub- 
reflector is down about 13 db; including a 6 db inverse distance attenua- 
tion, the resulting taper across the secondary reflector is approximately 
20 db. The measured gain was 4 db down from full area. By reducing 
the diameter of the subreflector, the taper is also reduced, thereby in- 
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Fig. 11 — Radiation patterns of deep paraboloid. 


creasing the gain (in this process, noise performance is often sacrificed 
for gain). A 24-inch subreflector was found to be a suitable compromise, 
measured gain for both polarizations being 47.3 db (2.8 db down from 
area). Somewhat surprisingly, as indicated by curve B of Fig. 10, there 
is no significant deterioration in noise performance. 

Average SWR measurement for the configuration last mentioned is 
1.11, which is equivalent to a return loss of 25 db, an improvement of 
3 db over that of the shallow dish. 

The gain of the standard Cassegrainian configuration was 47.4 db 
and 47.7 db for the vertical and horizontal polarizations, the average 
value being 0.24 db higher than that of the near-field device. 


VI. DISCUSSION 


The measurements discussed here have been directed toward evalua- 
tion of the noise performance of several types of feeds for full paraboloidal 
reflectors. In particular, it is found that the near-field Cassegrainian 
feed, a device whose design is based on simple geometrical optics, 
performs exceptionally well, its low-noise characteristics being as good 
or better than those of the standard Cassegrainian feed over all angles of 
elevation. This result holds true for both shallow and deep secondary 
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reflectors. The efficiency of the near-field Cassegrain is about 55 per cent, 
similar to that obtained using more conventional feeds; the radiation 
patterns are unsymmetrical due to lack of symmetry in the phase of the 
primary field. 

Deterioration in noise performance due to the struts (or spars) used 
for supporting feed structures has been examined. Dielectric struts, 
such as those of fiber glass, have been found to introduce noise because 
of loss in the material. A pressurized membrane has been tested as a 
support for subreflectors; it appears satisfactory mechanically and 
minimizes degradation in electrical performance. 
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APPENDIX A 
Transmission Line Measurements 


A.1 Noise Measurements (Short Circuit) 


The noise that exists in a transmission line is caused by resistive losses 
in the line itself and by noise generators which may be at either or both 
ends of the line. Consider the infinite transmission line of Fig. 12(a), 
which for the moment is assumed to have a noise-free measuring device 
at terminal A. Let the thermodynamic temperature of the line be 77, 
and a the power absorption coefficient. Since the line is of infinite length, 
the noise temperature measured at A is simply 7, = 77. Divide the line 
into segments 1 and 2 at point /. The contribution to the noise at A 
by segment 2 is Te “’ (since the line is infinite); therefore that con- 
tributed by segment 1 is 7;(1 — e *’). A series expansion gives 


Ti1—1l+al— Ga E (2) as) 
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Fig. 12 — Noise measurements on shorted transmission line. 


which for small @ is approximately Tial. Naturally, if the total loss, al, 
is known, the noise produced is obtained immediately.* 

Fig. 12(b) shows a movable shorting piston, S, near the terminals A 
of the amplifier to be used for the noise measurements. The amplifier is 
not perfectly noise free, nor is it perfectly matched; therefore movement 
of the piston produces a cyclical variation in the noise at A. 

Let the effective temperature of the maser amplifier at terminals A 
in Fig. 12(b) be designated 7’, ; this represents the intrinsic noise, which 
amounts to about 3°K. 7’; represents the effective temperature of the 
transmission line and 7’, that of the shorting piston, S. The voltage 
reflection coefficient at A (the input mismatch of the maser) is p. The 


* This assumes that the thermodynamic temperature of the line is constant; 
if not, the effective noise temperature is given by 


U x 
T = i T(x)a(x) exp ©! a(x) is) dx 


where 7'(x) is the temperature distribution along the line. 
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sources that give rise to noise 7’, traveling to the left (i.e., into the am- 
plifier) at A are: 


Tm, — intrinsic amplifier noise; 

Tm. — amplifier noise traveling to the right, and reflected at S 
(uncorrelated with T'n,); 

T1, — line noise initially traveling to right and reflected by 8, also 
that reflected by p at A and again by §; 

T,, — line noise initially traveling to left, also that reflected by p 
and by S (uncorrelated with 7,); 

Ts —vshorting piston noise, traveling to left, also that reflected by p 
and by 8. 


Thus, to first order the noise entering the amplifier at A is 
T= Tn, + Ta, + Ti + Ti, + Ts. 
If the attenuating effects of line loss are neglected (since they are only 
of the order 0.1 db), the above sum to first order becomes 
T= Tmy + (Ting + 27. + Ts) (1 + 2’) 


i 
+ OCT vs + 27; + T's)p COS 26 


where 
27,=7T,4+T. 


8 being the propagation constant of the line. 

Measured data (noise versus short position) shown as crosses in Fig. 
12(c) were taken using a short circuit in round waveguide. Also shown 
(as a solid curve) are data calculated using the following constants in (7): 
Tm, = 3°, Tm, = 3°, Ti = 10°, T's = 2.5° (short circuit with a standing 
wave ratio of 250) and p = 0.075 (22-db return loss), the last two being 
measured values; these result in 


T = 28.6 + 3.8 cos 26l. (8) 


Now let an additional length of transmission line be added to / such 
that the total length is J, , and let the short S be moved to the end of 
this line. The data, shown as dots in Fig. 12(c), were obtained when 
approximately 6 feet of 2.8-inch diameter line* were added. The dashed 
curve is a plot of (7) with the following constants: 


Pn = Tm = 3°, Ti, = 14°, P= 25", p = 0.075, 
T, = 36.7 + 5.0 cos 26l. (9) 


* The line actually comprised 45 inches of straight guide and a right-angle bend 
of 15-inch radius used in the standard Cassegrain feed. 
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If (8) is subtracted from (9), one has 
(71, — T) (1 +p) + 4(T1, — Tr)p cos 261 
= 7.9 + 1.2 cos 21. 


From the first terms of both sides of (10), one obtains 2(7,, — T1) = 7.9, 
or an increased noise temperature 7), — JT; = 3.95° due to the addi- 
tional length of line. From the second terms one obtains 4(71, — T1)p = 
1.2 or T,, — T; = 4°. The accuracy of the latter value is very dependent 
upon an accurate value for p (measured as 0.075), whereas the first 
value, 3.95° (which is really the difference between the average values 
of the plots in Fig. 12(c), is good to order 1 + p’. The value 3.95°K 
is equivalent to a loss of 0.052 db in the additional length of line. 


(10) 


A.2 SWR Measurements (Short Circuit) 


Using the same shorting piston and transmission line as above but 
adding an additional 60-inch length of waveguide (diameter 2.8 inches), 
the measured VSWR was 85, which is equivalent to a loss of 0.0093 
db/foot. 

The loss per unit length derived from the noise measurement just 
discussed is 0.0092 db/foot, indicating close agreement between the two 
methods. 


APPENDIX B 


The Antenna Noise — T, 


Since typical antenna patterns have significant levels in the side and 
back lobes, it is necessary to consider the effects of noise due to thermal 
radiation from the environment into the antenna.” This effective noise 
temperature is designated by 7; . 

Consider first the ideal radiation pattern shown in Fig. 13(a): it 
has a very narrow beam of width a, the gain G being constant over the 
angle a; it has no back lobes. The antenna is assumed to be lossless and 
to be mounted height h above the ground. Beamed at various angles @ 
with respect to zenith, this antenna sees the true brightness tempera- 
ture, T(@), due to various noise sources. For 0 < @ < 2/2, T(6) = T'(6) 
is the sky temperature. For 7/2 < 6 < 7a, the brightness temperature 
is due to both sky and earth, as shown in Fig. 13(b), since the sky 
noise from angle + — @ is reflected at point P according to the reflection 


* This effect has been discussed recently in Ref. 16. 
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Fig. 13 — Ideal antenna and its environment. 


coefficient of the earth, r(6), at P, whereas the earth generates noise that 
enters the antenna directly dependent upon the coefficient a(6), which 
represents absorption at P. Noise due to loss in the atmosphere along 
path length 7 also contributes to the brightness temperature. Since 
r(0) + a(@) = 1, the brightness temperature for 7/2 < @ < wis 


T(6) = r(6)T.(m — 6) + TL — r(6))To + T2,(8 — 2/2) (11) 


where 7’), the temperature of the ground, is assumed to be 300°K. 

The term 7,(6 — 2/2) of (11) represents noise due to the path in 
the atmosphere between the antenna and the point P. Compared with 
other noise sources, it is found to be negligible, and therefore has been 
disregarded in what follows. 

The reflection coefficient r(@) is highly dependent upon the environ- 
ment and to some extent on polarization; it usually varies with time, 
being a function of the ground conditions over vegetated areas and the 
wave conditions over water. Using representative data at 10 cm wave- 
length for the reflection coefficient,” the sky temperature, and (11), one 
can estimate the brightness temperature distribution for all angles 6, as 
shown in Fig. 14. Curve B is for smooth sea water and curve A for a 
perfectly reflecting mirror (which images the sky noise), whereas the 
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Fig. 14 — Brightness temperature distributions for middle em-wave band. 


poorly reflecting ground environment, curve C, approximates a perfect 
absorber. 

An actual antenna has a finite radiation pattern G(6); assuming it 
to be symmetrical about the main axis of the antenna beam, the equa- 
tion relating antenna temperature to radiation pattern and brightness 
temperature is 


T, +7; = al G(0)T(6) sin 6 do (12) 


for the antenna beamed vertically. If the antenna is beamed at angle 
6’ with respect to the zenith, (12) becomes 


T.(6) + 7,(0’) = ah " G(6 — 6')T (6) sin 0 dd. (13) 


CASSEGRAINIAN ANTENNAS 2701 


As a simple application of (12), consider an isotropic antenna sur- 
rounded by a noise-free sky and a perfectly absorbing earth. In this 
case G(@) = land T(@) = Ty for 7/2 < 6 < a; thus 

T.+T=zf  Tosine do = <° = 150°K. 
2 w/2 2 

An idealized radiation pattern for a microwave antenna is shown in 

Tig. 15, where 


I 


G(6) Go, 0<6<a/2 
and 


G(6) 


Gs ) a/2 < 0 < Ty, 
G, being the average gain in the side and back lobes. Again assuming a 
noise-free sky and perfectly absorbing earth, 


ToGo 
2 


Le => 5 | GT sin 6 dé = = 150G,°K. 
aw/2 


Thus, for example, if G, = 0.1 (10 db below isotropic), 7,, = 15°K. 

Using the idealized antenna pattern of ig. 15 and the data of Fig. 14, 
let us now integrate numerically according to (12). The noise contribu- 
tion from the main beam (the so-called sky noise) is 


af[2 
T, = : / Go T,(6) sin 6 d@ ~ 2.5° 
0 


which is readily taken from Fig. 14. 








BA 65 


Fig. 15 — Idealized antenna pattern. 
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Using G, = 0.1, the contribution due to sky noise in the far side lobes 
(a/2 <0 < 2/2) is 


_& 


Le; 9 


r/2 
| 2.(0) sin 6 do = 0.7°. 
a/2 
From the region 7/2 < @ < x (ground, etc.), the contribution is 
Tr, =f 1(6) sine a 
2 w/2 


which amounts to 0.7° for the antenna above a perfect reflector, 7.6° 
above sea water and 15° above a perfectly absorbing earth. 

Thus for an antenna with far side and back lobes 10 db below an 
isotropic radiator, the total antenna noise due to atmosphere and en- 
vironment for zenith orientation of the beam is 


Te = Z, + fe + Ly 


which amount to 3.9° (perfect reflector) 
10.8° (calm sea water) 
18.2° (ground with vegetation which approximates 
a perfect absorber). 
T, = Ty, + T,, for the above conditions is 
1.4° (perfect reflector) 
8.3° (calm sea water) 
15.7° (ground with vegetation) 
obtained simply by subtracting the sky noise (2.5°K) from the previous 
numbers. 


APPENDIX C 


The Signal-To-Notse Ratio and Quality Factor of an Antenna 


For the idealized antenna pattern of Fig. 15, the received power at the 
terminals of the antenna oriented toward a white noise signal source is 


Ps = SAB = SG,(X°/4r)B 


where S is the incident signal flux, B the bandwidth, and A the effective 
area of the antenna. The total noise in the antenna is Py = kT.B, k 
being Boltzmann’s constant and JT, = T, + Ty + 77. The contribution 
T, is the sky noise in the main beam; it is essentially independent of 
the gain Go for high-gain antennas. 7; and YT, are the effective noise 
temperatures due to back lobes and line losses. The signal-to-noise ratio 
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for the antenna is therefore 


Ps SX @ _ SX Go (14) 
Py 4k T, 4ak(T. + Te. + T1)° 
This assumes that the noise figure of the receiving amplifier 1s negligible. 
Of course, the receiver noise and the antenna impedance must both be 
considered in calculating the system noise. 

Of the terms contributing to 7, in (14), 7’; is unavoidable and only 
T, and T; can be attributed to deficiencies in the antenna. We can 
define a quality factor for the antenna in the following way: set 7 equal 
to zero and multiply numerator and denominator of (14) by 7; then 


Pen: SX” GoTo _ Sn” (15) 
Py 4ckT) (TM +71) 4akT 


where Q = GoTo/(T> + T1) is the quality factor. Examples of typical 
values of Q are: 

(1) An isotropic antenna completely surrounded by a perfect ab- 
sorber at 7’) = 300° (7; = 0, no line losses), Q = 1. 

(2) An isotropic antenna surrounded by a perfectly absorbing earth 
and noise-free sky, (no line losses), Q = 2. 

(3) The antenna above ground as discussed in Appendix B with far 
side and back lobes 10 db below an isotropic radiator, (where 7; ~ 15°), 
Q = 20G. 

(4) The near-field Cassegrain as discussed in Section V 


(%=54X10, %M+7, = 4°+4°=89), Q@=2x 10° 





(5) A horn-reflector antenna with the same aperture area and trans- 
mission line loss as in (4) above 


(Qo=7X10, %14+7=424+1°= 5°), Q=42 x 10° 
Similarly, one can define a quality factor for the total receiving system 
as 
= GoT'o 
Qr > Tr fe Ly 


where 7’z represents all noise associated with the receiver proper, and 
T,, all noise associated with the antenna. 
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A Two-Gyro, Gravity-Gradient Satellite 
Attitude Control System 


By J. A. LEWIS and E. E. ZAJAC 
(Manuscript received April 24, 1964) 


This article gives the results of an analytical and numerical study of a 
two-gyro, gravity-oriented communications satellite. The principal purpose 
of the study was to uncover and solve the analytical problems arising in the 
design of passive gravity-gradient attitude control systems. Although the 
study was directed at satellite orientation, it 1s felt that many of the tech- 
niques developed have general use in the investigation of dynamical systems. 

We consider both small and large motions about the desired earth-pointing 
orientation. In the small-motion study, the goal 1s simultaneous optimization 
of the transient response and the forced response to perturbations caused by 
orbital eccentricity, magnetic torques, solar torques, thermal rod bending, and 
micrometeorite impact. In the large-motion study, we enumerate all possible 
equilibrium positions of the satellite and then consider initial despin after 
injection into orbit, inversion of the satellite from one stable equilibrium 
position to another by switching of gyro bias torques, and the decay of transt- 
ent motions resulting from large initial angular rates. 

As a specific numerical example, we have treated a 300-lb satellite in a 
6000-nm orbit, stabilized by a 60-ft extensible rod with a 20-lb tip mass, 
and by two single-degree-of-freedom gyros, each with an angular momentum 
of 10° cgs units. Without a detailed discussion of hardware, it is concluded 
that such a system, having a total weight of 50 to 75 pounds including power 
supply, will provide a settling time for small disturbances of less than one 
orbit and will hold the antenna pointing error within a few degrees. 
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I. INTRODUCTION 


It has been known for over two hundred years that the variation in the 
gravitational field over the length of an earth satellite generates torques 
which tend to keep the axis of minimum inertia of the satellite pointing 
toward the earth. In particular, this mechanism keeps one face of the 
moon earth-pointing. 

Such gravity-gradient orientation of communications satellites is very 
attractive because the simplicity of the effect leads to the possibility of 
simple attitude control and hence high reliability and long life. On the 
other hand, the tiny size of the gravity-gradient torques means formida- 
ble mechanization problems, and although Pierce suggested its use as 
early as 1955,! gravity-gradient stabilization has been widely held to be 
impractical. 

However, several recent analytical and hardware studies have resulted 
in proposals for practical, gravity-gradient controlled satellites. All the 
proposed schemes work on the same principle. Steady-state perturba- 
tions, due, for example, to magnetic and solar torques, are kept within 
tolerable limits by making the satellite inertia sufficiently large, usually 
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with some sort of extensible rod-tip mass combination. Damping of 
transient perturbations is provided by connecting the satellite through a 
dissipative joint to an ‘‘anchor,” that is, to some object that will allow 
energy dissipation by virtue of relative motion between itself and the 
satellite proper. The anchor may be one or more gyros, as in the schemes 
discussed by Ogletree, et al.,?3:4 by Burt,’ and by Scott;® a second rigid 
body, cither hinged to the satellite, as proposed by Kamm,’ by Paul, 
West, Yu, et al.;§-° or a second rigid body at the end of a compliant dumb- 
bell as discussed by Paul,!° by Newton," and by Fischell and Mobley ;? 
or a second, fluid body, as considered by Lewis.* 

In this article we examine a gravity-gradient system anchored by two 
gyros. A schematic of the system is shown in Fig. 1, where also is indi- 
cated the standard nomenclature for axes: the pitch axis is normal to the 
orbit plane, the yaw axis is along the local vertical, and the roll axis is 
along the orbital track. Each gyro rotor is contained in a gimbal can (not 
shown in the schematic), mounted on bearings, and immersed in a fluid 
bath. Thus, fluid shear produces the required energy dissipation. The 
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Fig. 1 — Schematic of two-gyro, roll-vee configuration. 
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gyros are single-axis gyros: that is, the spin vectors are constrained by 
the gimbal bearings to lie in a single plane within the satellite. In the 
position shown, this is the pitch-yaw plane. 

Because of the small physical dimensions of the gyro “anchor,” this 
system has the virtue that the dissipative joints can be sealed within 
the satellite; the joints are not exposed to the space environment when 
the satellite is operating. Also, the required inertia augmentation is 
particularly simple: only a single extended rod-tip mass. 

A simple explanation of how single-axis gyros damp out an arbitrary 
motion can be given in terms of the rate or torque-seeking property of 
gyros. A torque applied to a gyro will cause it to precess. By conservation 
of angular momentum, the precession will try to line up the gyro spin 
vector with the applied torque or angular rate vector. 

Bearing this in mind, assume that the satellite is in orbit in its earth- 
pointing orientation. It then is rotating at the rate of one revolution per 
orbit about the pitch axis. If the gyro gimbals were free, this pitch rate 
would cause the spin vectors to align themselves in the direction of the 
pitch axis. However, in order to obtain three-axis damping, the spin 
vectors are held in a vee position by equal and opposite constant torques 
(see Fig. 1), applied to the gimbals. 

Now, if the satellite is disturbed about the pitch axis, both gyros seek — 
the disturbance, resulting in a scissoring motion of the gimbals relative 
to the satellite, damping out the pitch disturbance. A yaw disturbance 
causes an in-phase motion of the gyros and again energy is dissipated. 
Since the gyro spin vectors are constrained to move in the pitch-yaw 
plane, they are constrained from moving toward a disturbance about the 
roll axis. However, the roll and yaw motions are coupled. Hence in this 
case the gyros again try to line up with the yaw axis. Thus three-axis 
damping is obtained. 

Our work continues a study carried out by the Instrumentation Lab- 
oratory?= of the Massachusetts Institute of Technology, under the 
sponsorship of Bell Telephone Laboratories, in which the particular two- 
gyro configuration studied here was shown to be the most promising of 
several possible gyro-anchored systems. Our primary objective, however, 
was not to design a specific attitude control system, which in any case 
would have to be integrated with the design of a specific satellite, but 
rather to develop general guiding principles and analytical and numerical 
techniques useful in such a design problem. Thus, we consider only the 
broad hardware questions that affect the analysis — for example, the 
design of extensible rods necessary to augment satellite inertias —- but 
we do not go into the detail of specific gyro hardware, as would be re- 
quired in a complete design. 
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The organization of the article is as follows. Section II, for the general 
reader, summarizes the results of our study in some detail in nonmathe- 
matical terms. Following some general remarks about inertia levels, ap- 
plicable to all gravity-gradient systems, we more fully describe the two- 
gyro system studied. We then summarize the system’s small-angle per- 
formance, stressing, in particular, the performance obtained when the 
inertia of the satellite is augmented by the erection of a single rod. Next 
we discuss the effects of the maim small-angle perturbations: orbital ec- 
centricity, magnetic torques, solar radiation pressure, micrometeorite 
bombardment and thermal rod bending. Finally, we consider large-angle 
motions, starting with initial despin upon orbital injection by a combina- 
tion of rod erection and uncaging of the gyros. In the discussions of large- 
angle motions, we indicate that there may exist equilibrium positions 
far removed from the desired, earth-pointing position; we also show how 
these may be avoided. 

Gravity-gradient systems are bistable: that is, associated with a sta- 
ble, earth-pointing orientation is a second, equally stable orientation 
obtained by a 180° rotation about the pitch axis. In the concluding sec- 
tion of Section II we describe how the satellite can be flipped from one 
stable orientation to the other by means of a torque pulse applied to the 
gimbals. 

The results pertaining to the two-gyro system given in Section II serve 
as an outline of the analysis required for the design of any gravity-gradi- 
ent attitude control system. They also serve as an introduction to the 
theory in Sections III and IV. In these parts we present several results 
and methods that we feel apply generally to the design of many-parame- 
ter, linear dynamical systems (see Section III) and to large-angle mo- 
tions of a satellite (see Section IV). 

Specifically, in Section JII we develop various bounds on system set- 
tling time, and then show how series expansions in terms of system 
parameters can be used to explore the behavior of a linear system as a 
function of its parameters. We next describe a computer program based 
on the Routh criteria, which allows very rapid computation of system 
response as a function of system parameters. By these means, we are able 
to survey system behavior over the entire range of six system parameters. 

In Section IV, we develop the equations of large-angle motion, includ- 
ing the case of variable inertia, occurring during rod erection. Here we 
stress the superiority of direction cosines or Euler parameters as com- 
pared to Kuler angles in satellite kinematics, both from the point of view 
of computing speed and of ease in visualizing satellite motions. We then 
give the analysis of equilibrium positions, despin, and flipping or in- 
version. 
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II. SYSTEM DESCRIPTION AND SUMMARY OF RESULTS 


2.1 Gravity-Gradient Attitude Control Systems 


All gravity-gradient systems have one feature in common, namely the 
low magnitude of the gravity-gradient restoring torque, of the order of 
Ia?, where J is a typical satellite moment of inertia and Q is the orbital 
rate. The level of this torque is the main factor determining the steady- 
state response to constant and periodic disturbing torques. In particular, 
in the case of a typical communications satellite at an altitude of 6000 
nm, the magnitude of the torque exerted by the geomagnetic field on the 
residual magnetic moment of the satellite is such that the satellite inertia 
must be increased by a factor of about forty to reduce the steady-state 
response to an acceptable level. 

The low level of the gravity-gradient restoring torque also implies 
low system natural frequencies, of the order of the orbital rate Q. Corre- 
sponding to this low natural frequency is a minimum 1/e settling time 
of the order of a fraction of an orbit. Zajac!‘ has shown that all the sys- 
tems mentioned above have pitch settling times no less than about one 
tenth of an orbit. This, of course, is a lower bound on minimum settling 
time for three-axis motion. 

Based on these simple considerations, we would expect that all well 
designed gravity-gradient attitude control systems would have about the 
same transient and steady-state performance, that they would all have 
settling times of a fraction of an orbit, and that they would all require 
some form of inertia augmentation to obtain acceptable steady-state 
response. Thus the choice of a particular gravity-gradient attitude con- 
trol system should be based mainly on ease of mechanization and long- 
time reliability, rather than system performance. 

In the present case, at least, requirements on the large-angle per- 
formance of the system (despin, satellite inversion, etc.) preclude choos- 
ing the system parameters to give minimum settling time, although the 
settling time is not greatly increased by meeting the other requirements. 
It is likely that such a compromise would be necessary for optimum over- 
all performance of any gravity-gradient system, so that the minimum 
settling time is of academic interest only. Of more importance is the 
variation of system performance with variation in system parameters. 
We have thus taken the view that a broad survey of performance as a 
function of system parameters is of more interest than an optimization 
based on a single measure of system performance, e.g., settling time. 

In the following sections we describe the configuration and perform- 
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ance of the two-gyro system in detail. The interested reader may find the 
corresponding theoretical analyses in Sections III and IV. 


2.2 System Description 


Fig. 2 shows the more important features of the typical single-axis 
gyro indicated schematically in Fig. 1. The basic element of the gyro is 
a rotor which spins rapidly about the spin axis and generates a certain 
angular momentum vector. 

The spinning rotor element is enclosed in a sealed gimbal can, mounted 
on bearings so that it can rotate about a single axis, the gimbal or output 
axis. A fluid-filled gap between gimbal can and gyro case provides damp- 
ing as the gimbal rotates. 

In the system considered, the two gyros have their gimbal axes along 
the satellite roll axis. The gyro spin axes are disposed in a vee configura- 
tion around the satellite pitch axis, which is also the axis about which the 
satellite rotates to remain aligned with the local vertical as it traverses its 
orbit. To distinguish this arrangement from other possible two-gyro con- 
figurations,?"° it will be called a “‘roll-vee” configuration. 

In the vee arrangement, torques must be supplied constantly to change 
the direction of the gyro angular momentum vectors, as the satellite 
traverses its orbit. These torques, constant in magnitude and exerted 
about the gimbal axes, are provided by a constant electrical signal into 
electromechanical torquers on the gimbal axes. 

It is also possible to inject a signal into the torquers on ground com- 
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Fig. 2 — Single-axis gyro. 
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mand. This can be used to invert the satellite if it should get into an un- 
desirable equilibrium position. This possibility is discussed in the sequel. 

In order to spin the gyro motor, current must be brought into the gim- 
bal can. This is done by means of highly compliant flex leads. In the pres- 
ent application, the flex-lead spring constants, exerting a small restoring 
torque around the gimbal axis, can be neglected. However, for a typical 
communications satellite without inertia augmentation, the flex-lead 
torques can be of the same order as the gravity-gradient torques. 

In any case the gimbal excursions must be limited by suitably placed 
stops. The location and nature of these stops is an important design con- 
sideration. In the first place, undesired equilibrium positions, with the 
gyro gimbals against the stops, may occur if the stop positions are not 
carefully chosen. In the second place, large tumbling rates may force the 
gimbals against the stops, where they are capable of only limited relative 
motion, depending on the stop elasticity. In both cases the available 
damping may be greatly reduced. The equilibrium positions may be 
dealt with analytically, while the large motion may be studied numeri- 
cally with the stops simulated by hardening springs. 


2.2.1 Weighi and Power Requirements 


For the attitude control of a typical communications satellite in a 
6000-nm altitude orbit, we require two single-axis gyros, each with a 
rotor angular momentum of about 10° cgs units, weighing about 10 
pounds and requiring from 7 to 10 watts power to drive the rotor motor. 
In addition we require some sort of inertia augmentation which we shall 
assume is supplied by a single extensible 60-foot rod of the STEM 
(self-storing tubular extensible member) type, designed and developed 
by DeHavilland Aircraft of Canada, Ltd., and described in detail in 
Ref. 8, together with a 20-pound tip mass, which also serves as the tape 
storage drum. We then have the attitude control system weight break- 
down given below: 


2 10° cm-gm-sec gyros 20 lbs 
1 tip mass 20 Ibs 
1 extensible rod 4 Ibs 
gyro power supply 
(2 Ibs of solar cells/watt) 40 lbs 
total 84 Ibs 


We have assumed that the satellite proper is a four-foot diameter sphere, 
weighing 300 pounds, with a moment of inertia of 20 slug-ft?. 

It is believed that the above estimates are quite conservative and sub- 
ject to considerable reduction. The power is used to maintain the gyro 
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rotor speed constant mainly against bearing drag. In a zero-g environ- 
ment the bearing drag might be substantially reduced. In any case there 
is probably a trade-off between gyro life, requiring heavily lubricated 
bearings, and minimum rotor power, requiring light lubrication. 

The rod length is chosen to increase the satellite pitch and roll inertias 
from 20 to 2000 slug-ft?. This inertia augmentation sufficiently despins 
the satellite from currently estimated injection rates of 0.5-1.0 rpm to 
cause capture by the gravity-gradient field. The required inertia aug- 
mentation varies roughly linearly with initial injection rates (see Section 
2.4.1). With a sufficiently small injection rate, the augmented inertia 
could be reduced to the 700 slug-ft? level required to counter magnetic 
torques (see Section 2.3.4.1). Such a reduction in inertia would mean 
smaller gyros, and, again, less power. 


2.3 Small-Angle Performance 


In order to study the small-angle transient and steady-state response 
of the roll-vee gyro attitude control system, extensive tables giving decay 
rates, response to orbital eccentricity, and response to periodic torques 
at zero, one, and two times orbital frequency Q as functions of the system 
parameters were produced by an IBM 7090 computer in a running time 
of 0.04 hour by a procedure outlined in Section III. Figs. 3 through 15 
summarize this broad survey. For each pair of inertia ratios, B/A, C/A, 


y s= ! ORBIT 
0.78 / 102 1.16 





Fig. 3 — Asymptotic settling time in orbits (reduction of 1/e). 
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where A,B,C, are the satellite pitch, roll, and yaw moments of inertia, 
satisfying the inequalities 


AZB2C, B+C2ZA, 


values of gyro parameters were chosen from these tables to minimize the 
asymptotic settling time, i.e., the time in which the most lightly damped 
mode of motion is reduced by 1/e. Fig. 8 shows the corresponding 
settling times, while Figs. 4 and 5 give the gyro dimensionless parame- 
ters 


h = (H/AQ) cos a, h’ = (H/C)p) cos a, 


where a is the vee half-opening angle, H the gyro angular momentum, 
and C'p the gyro damping constant for both gyros. Since the small roll- 
yaw motion depends only on H in the form H cos a, the above is a con- 
venient choice of parameterization. In all cases, except those indicated, 
the best value of a was 60°, at least over the relatively coarse grid of 
Ah = 0.25, Ah’ = 0.25 and Aa = 20° used in the tables. 

Figs. 6 through 15 give the steady-state response to an orbit eccen- 
tricity « = 0.01 and to periodic torques of amplitude 0.01 A for the 
same values of gyro parameters. Note that the eccentricity response when 
e = 0.01 is of the order of 1° over the entire range of inertia ratios, hav- 
ing a maximum value of less than 3°. Both the pitch offset, due to a con- 
stant pitch torque, and the roll amplitude, due to a periodic roll torque 
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Fig. 4 — Gyro parameter h = H cosa/AQ. 
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Fig. 5 — Gyro parameter h’ = H cosa/Cp. 


at orbital frequency, depend only on the satellite inertias, being given in 
radians by the simple relations 


| ee | = M/3(B = Cle, | ey | 1= M/3(A = Ce, 


for a torque of amplitude AZ. Similarly, for torques at frequency w > Q, 
the pitch, roll, and yaw amplitudes tend to the values M/Aw’, M/Bo”, 
M/C«w”, again independent of the gyro parameters. 


B/A 
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Fig. 6 — Pitch amplitude (degrees) for eccentricity e = 0.01 at orbital frequency. 
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0.19 0.21 0.24 0.27 0.32 0.38 0.48 0.64 0.95 1.91 


B/A 





Fig. 7 — Pitch offset (degrees) for constant pitch torque 0.01 A2?. 


2.3.1 The Minimum Settling Time 


These plots do not show the values of inertia ratios and gyro parame- 
ters which yield the smallest settling time. A search over a finer grid of 
parameter values gives a minimum value of settling time of 0.332 orbits, 
attained for B/A = 0.925, C/A = 0.175, h = 0.260, h’ = 0.688, a = 64°. 
To attain this value, a slightly negative gimbal spring K = —0.15 HQ 
cos a must be used. A negative spring constant may be realized by a sim- 
ple feedback circuit between gimbal pickoff and gimbal torquer. This 





C/A 


Fig. 8 — Roll offset (degrees) for constant roll torque 0.01AQ?. 
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0.38 0.57 0.57 0.57 0.57 0.57 0.57 0.57 0.57 0.57 





Fig. 9 — Yaw offset (degrees) for constant yaw torque 0.01A0?. 


minimum settling time is useful as a lower bound, but of more practical 
interest is the broad range of system parameters over which settling 
times of less than one orbit can be obtained. 


2.3.2 The Spindle 


The figures also do not give performance values for a “dumbbell” or 
“spindle,” i.e., a body for which A = B >> C. This ease is of particular 


B/A 





Fig. 10 — Pitch amplitude (degrees) for pitch torque amplitude 0.01AQ? at 
orbital frequency. 
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B/A 





Fig. 11 — Roll amplitude (degrees) for roll torque amplitude 0.01AQ? at orbital 
frequency. 


interest, since it may be realized by the erection of a single rod-tip mass 
combination. To describe the spindle, as well as to give a sample of the 
tables made by the computer, we reproduce the computer output for 
B/A = 1, C/A = 0.01 in Table I. Since the IBM printer has only a 
limited range of symbols, the following replacements were used: 


BB=b=B/A, CC =c=C/A, 
KAPPA = « = 1 + [K/(H® cos a)], 


where K is the gimbal spring constant, so that x = 1 means zero gimbal 
spring constant, 


HH =h, HP = NW’, 
ALPHA = a. 


The remaining quantities give the transient and steady-state responses. 
In particular, PO, Pl, P2, RO, Rl, R2, YO, Y1, Y2 are the pitch, roll, 
and yaw amplitudes in degrees for pitch, roll, and yaw torques of ampli- 
tude 0.01 AQ’ at zero, one, and two times orbital frequency. Note that 
PO and R1 are constant, since they depend only on b and c, while RO, YO 
are fixed for fixed values of h. The quantity E is the pitch amplitude in 
degrees at orbital frequency for an orbit eccentricity « = 0.01. Finally 
the quantities labeled “QUINT” and “C”’ give the real parts of smallest 
magnitude of the characteristic roots of the roll-yaw quintic and the 
pitch cubic in terms of the orbital rate Q. The smallest of these values 
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Fig. 12—- Yaw amplitude (degrees) for yaw torque amplitude 0.01AQ? at 
orbital frequency. 


D (say) determines the asymptotic settling time 7, = 1/(2 xD). Inspec- 
tion of the table reveals that, for h = 0.750, h’ = 1.25, a = 40°, we have 
the smallest settling time, for QUINT = —0.340(2), —0.657(2), ie., two 
roots with real parts —0.340 and two roots with real part —0.657, and 
one negative real root (not listed) of larger magnitude. Similarly, in this 
case C = —0.279(2), 1.e., two roots of the cubic with real part —0.279 
and one unlisted negative real root of larger magnitude. The asymptotic 





Fig. 13 — Pitch amplitude (degrees) for pitch torque amplitude 0.01AQ at 
twice orbital frequency. 
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B/A 





Fig. 14 — Roll amplitude (degrees) for roll torque amplitude 0.01AQ? at twice 
orbital frequency. 


settling time is then given by 1/[(27) (0.279)] = 0.57 orbits. Despite 
the coarseness of the table, this is very close to the minimum value of 
0.50 orbits for a spindle, attained for h = 0.77, h’ = 1.29, a = 38°. This 
minimum value may be calculated by an asymptotic expansion in the 
large quantity h/c = (H cos a)/CQ. 

Let us now attempt a specific “design.”’ This design must be regarded 





Fig. 15 — Yaw amplitude (degrees) for yaw torque amplitude 0.01A0? at twice 
orbital frequency. 
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as illustrative, rather than definitive, since a real design must take into 
account the fine details of gyro hardware as well as requirements imposed 
by the use of the satellite in an actual communications system. For ex- 
ample, it is not at all clear what limits on maximum settling time would 
be imposed by system requirements. We have tentatively set this maxi- 
mum settling time at one orbit. 


2.3.3 Transient Response for a Spindle. 


Fig. 16 shows the asymptotic settling time in orbits as a function of 
the dimensionless gyro angular momentum H/AQ for a = 40°, A’ = 1.25 
and for a = 60°, h’ = 1.00. The former gives a minimum settling time 
very near the optimum value for a spindle for H/AQ nearly unity, but 
varies more rapidly with H/AQ than does the other system. Also, we are 
particularly interested in large values of H/AQ—i.e., H/AQ > 2—since 
we propose to use the gyros as inertia wheels in the initial despin of the 
satellite after injection into orbit. In this case the second system gives a 
considerably smaller settling time (0.85 orbits, compared with 1 orbit, 
at H/AQ = 2). We may actually increase H/AQ to about 2.4 in this case 
and stay within the maximum settling time of 1 orbit. Undoubtedly, by 
trimming the values of a and h’, we may increase H/AQ even more, but, 
since this is intended to be an illustrative design, we do not consider 
these questions further here; instead, we simply take as our “design” 
a = 60°, h’ = h = 1.00 (4/AQ = 2.00). In the illustrative examples of 
the sequel, these parameter values will be assumed. From the table, they 
yield 


QUINT = —0.189(2), —1.318(2), 
C 


—0.190(2). 


Given the real parts of 4 roots of the roll-yaw quintic and 2 roots of 
the pitch cubic, it is a simple matter to calculate all the characteristic 
roots completely, especially for a spindle. In the present case we find 
solutions of the form 


re sin = 
e 0.19 Q¢ (0.642), e 6 er 
cos 


for the pitch motion, and 


019M sm (1.4008), Pts sin (0.532), gore 
cos cos 


CCL 


TasBLE I] — CompurrerR OUTPUT FOR SPINDLE SHAPE (A 








BB = 1.000, CC = 0.010, KAPPA = 1.000 








40.0 
60.0 
80.0 


HH=0.250 


PO= 0.19 RO= 0.14 YO =1.15 
HP=0.500, QUINT =—0.113(2), —0.576(1) 
ALPHA Ri= 0.19, R2= 0.46, Yl= 1.48, Y2= 0.35 
20.0 Pl= 0.29, P2= 0.55, E= 0.59, C= —0.015(2) 


0.30 
0.32 
0.09 


0.49 0.65.  —0.080(2) 
0.29 0.92  —0.337(2) 
0.04 1.35 —0.170(2) 


HP =0.750, QUINT =—0.150(2), —0.863(2) 


ALPHA R1= 0.19, R2= 0.56, Yl= 


1.62, Y2= 0.35 


20.0 Pi= 0.29, P2= 0.54, E= 0.60, C= —0.020(2) 


40.0 
60.0 
80.0 


0.31 
0.36 
0.07 


0.44 0.71 —0.109(2) 
0.22 1.22  —0.408(2) 
0.02 1.37 —0.110(2) 


HP=1.000, QUINT =—0.159(2), —0.582(2) 
ALPHA R1= 0.19, R2= 0.69, Yl= 1.79, Y2= 0.35 


20.0 P1= 0.30, P2= 0.53, E= 0.61,C= 


40.0 
60.0 
80.0 


0.32 
0.40 
0.06 


—0.024(2) 
0.40 0.76 —0.124(2) 
0.17 1.52 —0.364(2) 
0.02 1.38  —0.082(2) 


HP=1.250, QUINT =—0.149(2), —0.437(2) 
ALPHA R1i= 0.19, R2= 0.82, Y1= 1.99, Y2= 0.35 _ 
20.0 Pi= 0.30, P2= 0.52, N= 0.62, C= —0.027 (2) 


40.0 
60.0 
80.0 


0.32 
0.45 
0.05 


0.37 0.80 —0.130(2) 
0.15 1.83  —0.297(2) 
0.02 1.38  —0.066(2) 


HH =0.500 
PO= 0.19 RO= 0.14 YO= 0.57 
HP=0.500, QUINT =—0.118(2), —0.474(1) 
R1= 0.19, R2= 0.23, Y1= 0.82, Y2= 0.18 
Pl= 0.29, P2= 0.54, E= 0.61,C= —0.030(2) 


0.31 0.42 0.73. —0.160(2) 
0.31 0.17 1.26 —0.637(2) 
0.04 0.02 1.25 —0.088(2) 


HP=0.750, QUINT =—0.184(2), —1.170(2) 


R1= 0.19, R2= 0.28, Yl= 0.88, Y2= 0.18 

P1= 0.30, P2= 0.52, E= 0.63, C=—0.041(2) 
0.33 0.35 0.85 —0.212(2) 
0.34 0.12 1.71 —0.456(2) 
0.03 0.01 1.25 —0.058(2) 


HP=1.000, QUINT =—0.250(2), —0.750(2) 
R1= 0.19, R2= 0.34, Y1= 0.96, Y2= 0.18 
P1= 0.30, P2= 0.50, E= 0.64,C = —0.049(2) 


0.35 0.29 0.97 —0.231(2) 
0.36 0.10 2.13 —0.309(2) 
0.03 0.01 1.26 —0.044(2) 


HP=1.250, QUINT =—0.277(2), —0.516(2) 
R1i= 0.19, R2= 0.41, Y1= 1.06, Y2= 0.18 
P1= 0.30, P2= 0.48, E= 0.66,C = —0.052(2) 


0.36 0.26 1.07 —0.226(2) 
0.39 0.08 2.48 —0.237 (2) 
0.02 0.01 1.26 —0.035(2) 


= B>C) 


HH =0.750 
PO= 0.19 RO= 0.14 YO= 0.38 
HP =0.500, QUINT =—0.101(2), —0.410(1) 
R1= 0.19,R2= 0.16, Y1= 0.61, Y2= 0.12 
P1= 0.30, P2= 0.52,E= 0.62,G=—0.450(2) 


0.32 0.35 0.81 —0.239(2) 
0.27 0.12 1.43 —0.687 (2) 
0.03 0.01 1.22 —0.059(2) 
HP=0.750, QUINT = —0.161(2), —0.804(1) 


R1= 0.19,R2= 0.19, Y1= 0.65, Y2= 0.12 
P1= 0.30, P2= 0.49, H= 0.65, C=-0. 061 (2) 


0.34 0.28 1.01. —0.307(2) 
0.26 0.08 1.78  —0.336(2) 
0.02 0.01 1.22  —0.039(2) 
HP=1.000, QUINT = —0.236(2), —1.017(2) 


R1= 0.19, R2= 0.23, Y1= 0.69, Y2= 0.12 
P1= 0.31, P2= 0.47,E= 0.68,C=—0.072(2) 


0.37 0.23 1.21 —0.312(2) 
0.25 0.07 2.00 —0.236(2) 
: 02 0.01 1.22 —0.029(2) 
HP =1.250, QUINT =—0.340(2), —0.657(2) 


R1= 0.19, R2= 0.27, Y1= 0.75, Y2= 0.12 
P1= 0.31, P2= 0.44, E= 0.70, C=-0. 076(2) 


0.40 0.20° 1.39 —0.279(2) 
0.25 0.06 2.14 —0.184(2) 
0.02 0.01 1.22 —0.023 (2) 





PO= 0.19 RO= 0.14 YO= 0.29 
HP=0.500, QUINT = —0.085(2), —0.365(1) 
ALPHA R1= 0.19, R2= 0.12, Y1= 0.50, Y2= 0.09 
20.0 P1= 0.30, P2= 0.51, E= 0.63, C= —0.061 (2) 


HP=0.750, QUINT =—0.133(2), —0.645(1) 
ALPHA R1= 0.19, R2= 0.14, Y1= 0.53, Y2= 0.09 
20.0 P1= 0.30, P2= 0.47, E= 0.67, C= —0.082(2) 


HP=1.000, QUINT = —0.189(2), —1.318(2) 
ALPHA R1= 0.19, R2= 0.17, Y1= 0.56, Y2= 0.09 
20.0 Pl= 0.31, P2= 0.44, E= 0.71,C=—0.095(2) 


&626 


HP=1.250, QUINT = —0.266(2), —0.933(2) 
.19, R2= 0.20, Y1= 0.60, Y2= 0.09 


TaBLeE I — continued 





HH =1.250 
PO= 0.19 RO= 0.14 YO= 0.23 
HP=0.500, QUINT = —0.071(2), —0.329(1) 
R1= 0.19, R2= 0.09, Yl= 0.44, Y2= 0.08 
P1= 0.30, P2= 0.50, E= 0.65, C= —0.076(2) 


0.32 0.26 0.99  —0.393(2) 
0.19 0.08 1.47  —0.361(2) 
0.02 0.01 1.19  —0.036(2) 


HP=0.750, QUINT = —0.110(2), —0.556(1) 
R1= 0.19, R2= 0.11, Y1= 0.46, Y2= 0.08 
P1= 0.31, P2= 0.45, E= 0.70, C= —0.102(2) 


0.36 0.19 1.33 —0.453(2) 
0.16 0.05 1.61 —0.218(2) 
0.01 0.01 1.19  —0.023(2) 


HP=1.000, QUINT = —0.152(2), —0.990(1) 
Ri= 0.19, R2= 0.14, Y1= 0.48, Y2= 0.08 
P1= 0.32, P2= 0.41,E= 0.74,C=—0.118(2) 


0.41 0.15 1.70 —0.371(2) 
0.14 0.04 1.67.  —0.159(2) 
0.01 0.00 1.19 —0.018(2) 


HP=1.250, QUINT = —0.202(2), —1.199(2) 
R1= 0.19, R2= 0.16, YI= 0.51, Y2= 0.08 
P1= 0.32, P2= 0.38, E= 0.78, C= —0.123(2) 


0.46 0.13 2.07  —0.291(2) 
0.13 0.03 1.70  —0.126(2) 
0.01 0.00 1.19  —0.014(2) 


HH =1.500 

PO= 0.19 RO= 0.14 YO= 0.19 
HP=0.500, QUINT = —0.060(2), —0.301 (1) 
R1= 0.19, R2= 0.08, Y1= 0.40, Y2= 0.06 
P1= 0.30, P2= 0.48, E= 0.66, C= —0.091 (2) 


0.32 0.23 1.08 --0.467 (2) 
0.16 0.06 1.45 —0.298(2) 
0.01 0.01 1.18 —0.080(2) 


HP=0.750, QUINT = —0.091(2), —0.494(1) 
R1= 0.19, R2= 0.10, Y1= 0.41, Y2= 0.06 
P1= 0.31, P2= 0.43,E= 0.72,C=—0.122(2) 


0.36 0.17 1.47 —0.489(2) 
0.13 0.04 1.54 —0.186(2) 
0.01 0.00 1.18 —0.020(2) 


HP=1.000, QUINT = —0.125(2), —0.786(1) 
R1= 0.19, R2= 0.11, Y1= 0.43, Y2= 0.06 
P1= 0.32, P2= 0.38,E= 0.78,C=—0.140(2) 


0.41 0.13 1.90 —0.361(2) 
0.11 0.03 1.58 —0.1387(2) 
0.01 0.00 1.18 —0.015(2) 
HP=1.250, QUINT=-—0.161(2), —1.441(2) 
R1= 0.19,R2= 0.14, Y1= 0.45, Y2= 0.06 
P1l= 0.33, P2= 0.35,E= 0.83,C=—0.144(2) 
0.46 0.11 2.33 —0.278(2) 
0.10 0.03 1.60 —0.108(2) 
0.01 0.00 1.18 —0.011(2) 
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Fig. 16 — Asymptotic settling time in orbits versus gyro angular momentum. 


for the roll-yaw motion. Note that the period of the oscillatory solutions 
is comparable to the orbital period, as we would expect, and that both 
motions have rapidly damped exponential solutions. The latter feature, 
typical of spindle-shaped bodies, causes difficulties in numerical integra- 
tion of the differential equations, both for small and large motion, for 
it implies that derivatives may be very much larger than the dependent 
variable itself. 


2.3.3.1 Micrometeorite Impact. One source of transient disturbance is 
the angular momentum imparted by micrometeorite impact. It was 
estimated in Ref. 8 that for a satellite of comparable inertia level, im- 
pacts producing offsets greater than 5° would occur every two years and 
impacts large enough to tumble the satellite every 23 years, on the aver- 
age. A more recent study™ of the present two-gyro system indicates 
similar times if Whipple’s 1958 micrometeorite data are used. For 
Whipple’s 1963 data, the corresponding 5° and tumbling times are 40 
years and 1000 years. From a systems point of view, the low frequency 
of occurrence of these disturbances suggests that a settling time of 1 
orbit is quite adequate. 
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2.3.4 Steady-State Response 


Disturbances producing constant or periodic pointing errors may be 
classified as either kinematic or dynamic. The response to the former, of 
which orbit eccentricity is a typical example, is essentially independent 
of satellite inertia; the response to the latter type of disturbance may 
be reduced simply by increasing the level of satellite inertia to a suitable 
value. The kinematic response limits the minimum attainable pointing 
error; the dynamic response to given disturbing torques sets the level 
of satellite inertia. 

In the case of the spindle, the table yields an eccentricity response 
amplitude at orbital frequency of # = 1.81° for an orbit eccentricity 
e = 0.01. We also find the steady-state response amplitudes due to 
torque amplitudes of 0.01A” given by 


PO = 0.19°, Pl = 0.18°, P2 = 0.05°, 


RO = 0.14°, R1 = 0.19°, R2 = 047°, 
YO = 0.29°, Y1 = 0.56°, Y2 = 0.09°. 


We are particularly interested in the pitch offset PO, due to a constant 
pitch torque, and the roll amplitude R1, due to a roll torque at orbital 
frequency. Both of these are independent of gyro parameters and equal 
in the case of a spindle. Together with a given disturbing torque, they 
serve to set the satellite inertia level. 


2.3.4.1 Magnetic Torque — Satellite Inertia Level. In the case of a 
communications satellite, one of the principal disturbing torques is the 
torque exerted by the geomagnetic field on the residual magnetic mo- 
ment of the satellite. It has been estimated in Ref. 8 that this torque 
might be as large as 5 X 10 ° ft-lb for a satellite like the Telstar satellite 
at an altitude of 6000 nm. At this altitude Q = 2.73 X 10% rad/sec 
(~1 rad/hr). Because of the steady rotation of the earth-pointing 
satellite, this torque does not have a constant pitch component, but it 
will have a roll component at orbital frequency. Thus R1 is the response 
amplitude of interest. To make R1 equal to the eccentricity response of 
1.81° requires a satellite pitch moment of inertia A such that AQ’ is ten 
times the above torque, yielding A = 670 slug-ft?. Since a typical 
moment of inertia for a satellite somewhat larger than the Telstar 
satellite is 20 slug-ft”, this calculation indicates that some sort of inertia 
augmentation is required. We shall assume that the satellite proper has 
equal moments of inertia 49 = Bo = Co = 20 slug-ft” and that the pitch 
and roll moments of inertia are increased to 2000 slug-ft” by the erection 
of a single 60-ft extensible rod and a tip mass of 20 pounds. As indicated 
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in Section 2.2.1, we make the inertia somewhat larger than the minimum 
required to counter magnetic torque in order that rod erection may be 
used for satellite despin. 


2.3.4.2 Solar Radiation Pressure. When rods are erected to augment the 
satellite inertia, the perturbing torque due to incident solar radiation is 
in general increased, but not at the same rate as the inertia, if a thin rod 
and a dense tip mass are used. To give some idea of the order of mag- 
nitude of this torque, let us consider a 2-foot radius, 300-pound satellite, 
joined to a 0.2-foot radius, 20-pound tip mass by a 58-foot-long, 0.04- 
foot diameter rod. This yields a maximum solar torque of about 2 * 10-° 
ft-lb (0.0075 AQ for A = 2000 slug-ft?) around the center of mass of the 
system, which lies about 4 feet from the center of the satellite proper. 
This low torque is the result of a partial balance between the resultant 
force on the satellite and the resultant force on the rod, both yielding 
torques of the order of 5 X 1078 ft-lb (0.019 AM’). Even using this figure, 
the deflection due to solar pressure will be no larger than that due to 
magnetic torque. Thus such a satellite need not be especially designed to 
balance out solar torques. 


2.3.4.3 Bending Due to Solar Heating. In Ref. 8 the bending of an ex- 
tensible rod of the STEM type, due to differential solar heating, was 
analyzed. Further unpublished work by P. Hrycak and by J. G. Eng- 
strom at Bell Telephone Laboratories leads to the formula 


d/L = (L/4r)xaTo/[« + 4 + 168/3], 


for the deflection d of a rod of length Z, radius r, and expansion coeffi- 
cient a, where 


(aoS/eoo)*, 
tkhT)/r’aS, 6B = e:/eo, 


T 


I 


K 


with ao and é the rod external absorptivity and emissivity, e; the rod 
internal emissivity, o the Stefan-Boltzmann constant, S the flux of 
solar radiation through unit area in unit time, & the rod thermal con- 
ductivity, and h the rod wall thickness. The dimensionless quantity «x 
gives the ratio between heat transferred by conduction and by ra- 
diation. Typical values of the above quantities are: 


L = 60 ft, r= 0.02ft, h=2x 10 “ft, 
S = 442 Btu/ft”-hr, o = 171 X 10” Btu/ft-hr-(°R)*, 
k = 65 Btu/ft-hr-°F, a = 10°°/°F, 

ao = 0.67, o=—& = 0.33, 
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the last five values being appropriate for beryllium copper. In this case 
Ty = 635°R, « = 219, B = 1, and d/L = 0.0656. Note that in this case 
k > 1, so that conduction effects dominate. Unless 8 > 1, i.e., the out- 
side of the rod is highly reflecting and the inside “black,’’ we may use 
the simpler formula 


d/L = (L/4r) (ar’aoS/kh), 


obtained by neglecting the remaining terms in the denominator of the 
previous formula in comparison with x. In the present case this yields 
d/L = 0.0684, compared with the more exact value of 0.0656. 

The above displacement d is the displacement of the tip mass at the 
end of the rod and hence produces a corresponding rotation of the yaw 
principal axis of inertia through an angle of order d/L = 0.0656 = 3.7° 
and an antenna pointing error of the same size. Note that this angle 
increases linearly with L, so that thermal bending sets an upper limit 
on the length of a given type of rod which may be used for inertia aug- 
mentation. In observations of the Applied Physics Laboratory 1963 
22A satellite,” thermal bending manifested itself apparently as a high- 
frequency oscillation of the satellite’s attitude, attributed to the rapid 
heating of an extensible rod on passage from shadow into sunlight. 


2.4 Large-Angle Motion 


We shall discuss various large-angle motions of the gravity-oriented, 
gyro-stabilized satellite in order of their occurrence. First we consider the 
injection, despin, and capture of the satellite in orbit and the equilibrium 
positions into which it may settle. Next we discuss the use of the gyros 
to flip the satellite in case it settles into the inverted equilibrium position. 
Finally we report the results of computer studies of various large motions. 


2.4.1 Satellite Despin 


We assume that the satellite is injected into a nearly circular, 6000-nm 
orbit with an initial spin rate of less than 1 rpm around an arbitrary axis. 
After injection, erection of a single 60-foot rod with a 20-pound tip mass 
then increases the moment of inertia around axes normal to the rod from 
20 slug-ft? to 2000 slug-ft? and decreases the spin rate around these axes 
by a corresponding factor, e.g., from 250 rpo (revolutions per orbit) to 2 
rpo. The component of spin around the rod axis is, of course, unaffected 
by rod erection. This component of spin is removed by uncaging the 
gyros from their nominal equilibrium position, in which they have a 
zero net component of angular momentum around the rod axis (the 
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body yaw axis) and allowing them to precess toward the spin rate vector. 
The net change of yaw angular momentum due to this precession is of 
the order of the gyro angular momentum H = 2 AQ, where A is the final 
moment of inertia (2000 slug-ft?) around the body pitch and roll axes 
normal to the rod, and the angular momentum due to the initial spin 
around the rod axis is 250 CQ = 250 A,Q = 2.5 AQ, of the same order of 
magnitude. Note that this latter despin is in proportion to the difference 
of angular momenta, rather than their ratio, so that we might expect 
difficulties with the small differences of large numbers, leaving us with a 
sizeable angular velocity around the body yaw axis. However, the yaw 
component of angular velocity rapidly settles out; it is the yaw angular 
momentum, rather than the yaw angular velocity, which is of impor- 
tance. 

This is shown in Figs. 17-18 where, for the design of Section 2.3.2, 
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Fig. 17 — Despin during boom erection. 
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Fig. 18 — Despin to capture following boom erection. 


yaw, pitch, and roll rate, obtained by a digital computer, are plotted 
against time. At ¢ = 0, the satellite was assumed injected into the de- 
sired orientation, with gyros at the null position, and with a yaw rate of 
250 rpo (approximately 2 rpm). The elapsed time of Fig. 17 is two min- 
utes, corresponding to boom erection. In this time, the yaw rate de- 
creases to 20 rpo, while pitch and roll first peak at —12 rpo and —6 rpo 
respectively and then decay to —2.5 rpo at the end of boom erection. 
Subsequently, as shown in Fig. 18, all three rates decrease to less than 1 
rpo at the end of 1 orbit. 


2.4.2 Hquilibroum Positions 


Four equilibrium positions, in which the satellite is stationary with re- 
spect to the rotating local vertical, may be found by inspection. Two of 
these, shown schematically in Fig. 19, are the stable roll-vee positions 
with the gyro angular momentum vectors making a symmetrical vee 
with the orbit pitch axis (normal to the orbit plane), the gyro gimbal 
axes along the orbit roll axis (tangent to the orbit track), and the rod 
along the local vertical. The satellite antenna in this case is either di- 
rected toward the earth or away from the earth. We discuss the inversion 
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Fig. 19 — Equilibrium positions, stable roll-vee. 


of the satellite from the latter position in the sequel. Two other equi- 
librium positions are yaw-vee positions, Fig. 20, with the gimbal axes 
along the orbit yaw axis, i.e., the local vertical, and the gyro angular 
momentum vectors again making a symmetrical vee with the orbit pitch 
axis, and the rod along the orbit roll axis. These two positions are un- 
stable, however, just as they would be without the gyros. 

Other equilibrium positions occur because of the presence of the gyro 
gimbal stops. Suppose, for example, that the satellite is rotated around 
the local vertical through 180° from its normal operating position. The 
gyro gimbal torquers which normally hold the gyro vee open against 
the 1 rpo steady precession of the satellite in orbit, now act with the 
precession to force the gyro gimbals against stops located near the body 
yaw axis. The resulting symmetrical reverse vee configuration (see Fig. 
21a) is a possible satellite equilibrium position. Although the satellite 
antenna is still directed toward the earth in this position, it is an unde- 
sirable equilibrium position, because, when the gyro gimbals are against 
stops, their damping capability is severely reduced. This reversed equi- 
librium position can be made unstable by moving the gimbal stops in 
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(a) (b) 


Fig. 20 — Equilibrium positions, unstable yaw-vee. 





(a) (b) 


Fig. 21 — Equilibrium positions, reverse-vee and skewed. 
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from the yaw axis and by choosing the gyro angular momentum to have 
a suitable value, as discussed in Section IV. The inversion of the position 
shown in Fig. 21(a) and the corresponding reversed yaw-vee positions 
are unstable as before. 

Finally we note the possibility of skewed equilibrium positions, in 
which the body principal axes do not coincide with the orbit axes and 
both gyro gimbals are against stops (see Fig. 21b). Examples are dis- 
cussed in Section 4.3. Such unsymmetrical equilibrium positions may 
be easily eliminated by appropriate choices of stop positions and gyro 
angular momentum, but their occurrence suggests the necessity of a 
thorough investigation of equilibrium positions for any attitude control 
system, especially one in which constraints due to stops are present. 
The investigation of equilibrium positions also may serve as a guide in 
singling out lightly damped modes of large motion. 


2.4.3 Satellite Inversion 


As we have already noted, the satellite may be captured, after injec- 
tion into orbit, in inverted position with its antenna pointing away from 
the earth. With sufficiently large gyros it may be flipped from this posi- 
tion by changing the net gyro angular momentum by means of a simple 
signal injected by ground command into the gyro gimbal torquers. We 
simply reverse the polarity of the bias signals into the gyro torquers for 
a preset short time interval. The resulting change in angular momentum 
is just enough to cause the satellite to tumble, so that it is captured 
again in its normal operating position. Fig. 22 shows the result of such 
an inversion procedure, where the polarity is switched for 3 orbit. Here, 


COSINE OF ERROR ANGLE 





_ Fig. 22 — Error angle versus time during satellite inversion. Torquer polarities 
interchanged for 1/2 orbit. 
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we have plotted the cosine of the error angle, i.e., the angle between the 
body yaw axis and the local vertical. 


2.4.4 Computer Runs for Large-Angle Motions 


Since only a limited amount of energy may be imparted by initial 
displacement of the satellite, computer studies were directed at the ef- 
fects of high initial angular velocities. In Figs. 23-26 are shown some 
sample results of computer runs for the response of the satellite design 
of Section 2.3.2 to high initial rates, applied to the satellite in the stable 
roll-vee orientation. These may be regarded as responses to micro- 
meteorite impacts, or as representative of initial transients following 
inadequate despin. 

To save space, we again plot, as a function of time, only the cosine 
of the error angle between the yaw axis and the local vertical. However, 
the orientation of the satellite and of the gyro spin vectors are shown 
every half orbit in computer-made perspective drawings of a rectangular 
parallelepiped representing the satellite. The view is along the orbital 
track in the rotating, earth-pointing reference frame, so that the local 
vertical and normal to the orbital plane are in the plane of the paper. 
Plus signs are placed on the faces of the parallelepiped to avoid optical 
illusions. The gyro stops are indicated by dots. The reader may find 
more details about these drawings, as well as a description of computer- 
made movies showing large motions of the two-gyro satellite, in Ref. 15. 

As is seen from Figs. 23-26, rates of the order of 4 rpo about pitch 
and roll damp out in about 10 orbits, whereas yaw rates of even 100 rpo 
settle out in about 5 orbits. In roll and yaw, the settling time and motion 
are similar if negative rather than positive rates are applied. The re- 
sponses to positive and negative pitch rates are, however, different in 
character. A high positive pitch rate collapses the gyros toward the 
pitch axes, and a slowly decaying, essentially single-axis spin ensues. A 
high negative pitch rate opens up the gyros and drives them against the 
yaw stops. This sends the satellite into a complicated tumbling which 
eventually settles out. 

If a micrometeorite of linear momentum m strikes the satellite at a 
lever arm L from a principal axis with moment of inertia J, the angular 
velocity w imparted around that axis will be » = mL/I. This velocity 
varies directly with L and inversely with J. For the design of Section 
2.3.2, the yaw and pitch or roll lever arms are in the ratio 2/60, while the 
inertias are in the ratio 2000/20. A micrometeorite which imparts a pitch 
or roll rate of 4Q will impart a yaw rate of (2/60) - (2000/20) -49 = 
13.38Q. Therefore we see from Figs. 23-26 that the two-gyro spindle satel- 
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Fig. 24 — Response to +4 rpo pitch rate. 


GQaOMLILIV ALITIGLVs 


TOULNOD 


THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


2736 


Ot 6 8 





‘ayer youd oda F— 04 asuodsaxy — CZ “BLT 


SLIguO 
Z 9 S v € 2 | O- 


| | N\ IL A 





7 z + ; | ros 
i ey hey bE 


ATISNV YOUNS AO ANISOD 


SATELLITE ATTITUDE CONTROL 2737 





0.5 








COSINE OF ERROR ANGLE 


' 
° 
wo 


“1.0 


ORBITS 


Fig. 26 — Response to 100 rpo yaw rate. 


lite is considerably more resistive to micrometeorite impact about yaw 
than about pitch and roll. 

It is well known that gravity-gradient satellites will tumble if placed 
in a sufficiently eccentric orbit. Computer experiments showed that for 
the design of Section 2.3.2 this occurred at an eccentricity of about 0.2. 
Computer results for « = 0.225 are shown in Fig. 27. 


III. SMALL-ANGLE MOTION 


3.1 Satellite Configuration 


To settle the vexing questions of nomenclature and sign convention 
once for all, we commence with a brief description of the quantities char- 
acterizing a gravity-oriented satellite moving in a circular orbit (in- 
cluding the effect of small eccentricity later on) at the orbital angular 
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Fig. 27 — Tumbling for orbital eccentricity of 0.225. 
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velocity 2. A convenient number to remember, to fix the magnitude of 
Q, is that an orbit at an altitude of about 6800 statute miles corresponds 
to an orbital rate Q = 1 radian/hr and an orbital period of 27 or about 
6 hours, 15 minutes. 

For our purposes the satellite is described by its principal moments 
A = B 2 C about principal axes 2’,y’,z’, along which the principal unit 
vectors i’,j’,k’ lie. These principal axes form the body pitch, roll, and 
yaw axes, respectively. 

When the satellite is in a position of stable equilibrium, the 2’,y’,z’ 
body axes coincide with orbit pitch, roll, and yaw axes x,y,z as in Fig. 
19, with corresponding unit vectors i,j,k, normal to the orbital plane, 
along the orbit track, and along the local vertical toward the center of 
the earth. These orbit reference axes rotate at the orbital rate 2 (1 rpo) 
about the orbit pitch axis. It should be noted that, although a spindle- 
shaped body, formed by the extension of a single rod and tip mass, is 
shown, for which B } A and C < B, the small-angle analysis which 
follows covers the whole range of inertias, given by the inequalities 


A>B>C, 
required for stability, and 
B+C> A, 


imposed by rigid-body geometry. 

For small perturbations from equilibrium the satellite orientation is 
specified by the small pitch, roll, and yaw angles ¢,, g,, ¢., through 
which the body axes 2’,y’,z’ are rotated from the orbit axes x,y,z, as in 
Fig. 28. The corresponding satellite angular velocity vector w,;, with 
respect to inertial space, is given by 


Os = i(Q ae er) ai Jey = kg. , 
with respect to orbit axes, or 
Os = i’(Q sag x) j'(—-Q¢, = ey) “fe k’ (Q¢, a z); 


with respect to body axes. 


3.2 Roll-Vee System Equations for Small Motion 


By neglecting second-order terms in the dynamical equations of Sec- 
tion IV, as indicated in Section 4.2.2, we obtain the satellite equations 
of motion 
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Fig. 28 — Definition of small pitch, roll, and yaw angles. 


Ag. + 3(B — C) Mg, + 2H, = 0, 
Boy + [4(A — C)Q + 2H Mo, 
+ (A — B— C)Q + 2H Je. + 2HOMg, = 0, (1) 
Ce. + [(A — B)® + 2H Me, 
— [(A — B — C)Q + 2H |e, — 2H, = 0, 
for the satellite pitch, roll, and yaw angles ¢,, gy, ,¢.. The sum g, = 


3(¢y, + ¢9,) and difference y, = 3(¢% 9, — ¢,) of the two gimbal angles 
satisfy the equations 


Cov, + (Kk + HQ), H.¢; = 0, 
Ce RE Roe Eee eS, = 0. 


Here H, = H cosa and H, = H sin a. This is an eighth-order linear 
system of equations for ¢, , gy, ¢z,;Wg,¢ 9, Which splits immediately 
into a cubic pitch system for ¢g, and y, and a quintic roll-yaw system 
for gy , 2 , g , Since the pitch motion depends only on the out-of-phase, 
or “scissoring,”’ motion of the gyro gimbals, given by the difference 
angle y, , and the roll-yaw motion depends only on the in-phase gimbal 
motion, given by the sum angle g, . 
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These equations can be reduced to dimensionless form by setting 
p = (1/Q)d/dt, b = B/A, ec = C/A, h = (H/AQ) cos a, 
h’ = (H/Cp) cosa, x = 1+ [K/(A2 cos a)], 


yielding the two sets of equations: 


Pitch: 
[p° + 3(b — c)]oz + (2h tan a)py, = 0, 
— (h’ tan a)pe, + (p + «h’)y, = 0, 
Roll-Yaw: 
[bp’ + 4(1 — c) + 2hly, + (1 — b — c + 2h)py, + 2he, = 0, 
—(1—b —c + 2h)py, + [ep’ + (1 — b) + 2hly. — 2hpy, = 0, 
h'gy + h'pe. + (p + kh')e, = 0. 


If we insert appropriate terms on the right-hand sides of these equations 
to include the effect of given initial conditions and perturbing torques, 
we may regard the above systems as the Laplace transforms of the origi- 
nal set of differential equations, with transform variable p. The solution 
is then found by solving this set of linear, algebraic equations for ¢, , ¥, , 
etc., now interpreted as Laplace transforms, and calculating the residues 
at the poles of these functions of p. The transient response is entirely 
determined by the location of these poles and by the specific initial 
conditions. The steady-state response to a periodic perturbing torque 
at frequency NQ may be determined by inserting constant right-hand 
sides, in general complex, setting p = 7N, and solving for the amplitudes 


| Px li | Wo l5 ete. 


3.3 Transient Response 


For given initial conditions and zero perturbing torques, the trans- 
forms are rational functions of p, with the characteristic pitch and 
roll-yaw polynomials as denominators, given by 


Pitch Cubtie: 
filp) =p tap +tapt+e, (2) 
Roll-Yaw Quintic: 
fs(p) = p’ + axp’ + agp’ + asp’ + asp + as, (3) 
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where 


¢, = h'(x + 2h tan’ a), CG = 3(b— Cc), cz; = 3(b — c)kh’, 


and 
PN ee al 
c 
2 
cise b+ 2h , 4 ec) + 2h, O b c+ 2h) 
c b be 
pease (2 + 2) — 8¢ = 2h) 
be 
(1 — b + 2h)(4 — 4c + 2h) 
ag = 
be 
; 2hh'(1 — b + 2h) 
as = «hay — ———~ 


For stability, it is necessary that all the roots of the above polynomials 
have negative real parts. In particular, the magnitude of the real part 
of the root nearest the imaginary axis determines the rate of decay of the 
most lightly damped mode of motion. If this real part is —D, we can 
define a 1/e asymptotic settling time 7, = 1/2xD (in orbit periods) and 
use 7’, as a measure of transient response, particularly suited for use with 
a digital computer. In Section 3.6 we discuss the determination of D as 
a function of b, c, a, h, h’, and x. Once it is reduced to a suitably small 
value by some choice of system parameters, the short-time transient 
response can be determined by solution of the differential equations with 
specific initial conditions and the system parameters readjusted, if 
necessary. Actually systems chosen on the basis of minimum asymptotic 
settling time seem to have quite adequate short-time, as well as steady- 
state, response. 


3.4 Steady-State Response 


The steady-state response to periodic perturbing torques at frequency 
NQ, determined as previously outlined, is given in various cases by the 
following relations: 


Pitch amplitude for pitch torque AQ: 
lee | = (N? + @h”)*/| fai) |, (4) 
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Roll amplitude for roll torque AQ: 


_ 2 
lev| = | (= 2478 aw — an) 


1—b+2h = ) 
+ (Lobby — w)T / pew 
Yaw amplitude for yaw torque AQ: 


=e. , 2 
le.| = Ca kh’ — an — tN’) 


b 
4 — 4c + 2h ail | 
4 (4-4 ty 2 ny | / el foN) , 
Roll amplitude for yaw torque AQ’ and yaw amplitude for roll torque AQ: 


levl = lenl =| (L224? an’ — a) 


+ (atocth v) | /'b fi) |, 


(6) 


(7) 


where, by (2) and (3), 
| fo(iN’)? = (cs — cN*)’ + (@N — N°)’, 
| fs(iN)[? = (a3 — asN? + a,N*)? + (a,N — aN* + N°)? 


In particular, a constant pitch torque AQ’(N = 0) gives the constant 
pitch offset 


| v2 [o = 1/[8(b — c)], 
while, for a roll torque of amplitude A® at the orbital frequency (N = 1) 
le | = 1/[8(1 — ¢)). 


These amplitudes, independent of the gyro parameters, limit the mini- 
mum permissible satellite inertias for given perturbing torques. 

Finally, an elliptic orbit of small eccentricity « induces forced pitch 
vibrations at the orbital frequency 2 with amplitude 


lee le = 2e(1 + e1)*/[(es — 1)? + (e — 1)°I4 (8) 


By straightforward differentiation, it is easily shown that the eccentricity 
response | ¢, |. has a single maximum as a function of the gyro opening 
angle a. For a larger than the value at which the maximum is attained, 


2744 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


the eccentricity response decreases monotonically, approaching 2¢ as a 
approaches 90°. 


3.5 Bounds on the Asymptotic Damping Rate D 


As mentioned in Section 3.3, a convenient single measure of transient 
response is the parameter D, the distance from the imaginary axis of the 
right-most root of the characteristic equations. One would like to know 
D as a function of the system parameters, D = D(b,c,a,h,h’,x). In gen- 
eral, this function is impossible to determine analytically and must be 
computed numerically. In order to limit such computations to ranges of 
the parameters b,c,a,h,h’,« that give reasonable values of D, it is con- 
venient to have bounds on D. 

One set of bounds is given by the following theorem:”° 

If the coefficients qo, q,-°+: , of a polynomial P(p) are positive, 


P(p) = gp" + up” +++: +4 
then D ts bounded by 


n 
er ot (i) k=1,2,-+,n 


k>1, 


= Je L=0,1,-:: 
qi l 


We note that in both f;(p) and fs(p), in (2) and (8), go = 1. Hence by 
the above theorem with | = 0, if the system parameters are such that 
any of the coefficients ¢ , ¢2 , C3 , @1, °** , @5is small, then D will be small. 

Likewise, if any coefficient in f3(p) or fs(p) is large compared to a 
subsequent coefficient, then the theorem tells us that D will again be 
small. 

We note also that b — ¢ < 1, so that the theorem applied to c2 gives, 
in pitch, : 


Dsb-ce 


IIA 


1, 


ie., the asymptotic settling time 7’, in pitch for the roll-vee system is 
bounded by 7; 2 1/27 = 0.159 orbit. (This is slightly larger than the 
corresponding bound 7; = 5*/2+/3r = 0.137 orbit obtained in Ref. 17 
for a two-body satellite. ) 

From these bounds we conclude immediately that at best D can be 
of order unity, and to get a D of this order of magnitude the coefficients 
and ratios of coefficients in f3(p) and fs(p) must be at least of order 


unity. 
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Another useful bound is obtained by shifting the origin in the p plane 
to p = —D and applying the Routh criteria (see Section 3.7), to the 
shifted f3(p) polynomial. It is then found that one of the terms in the 
Routh array is 


r = —2D2° + [8D*? + 3(b — c)]x — 8D° — 6(b — c)D — 3(b — c) xh’, 
where 
x = kh’ + 2hh’ tan’ a. 


To have all roots to the left of the line Re p = —D, r must be positive. 
But it is easily verified that r > 0 only if 


3 (b —c) 
aap 
DE5% kh? 


which gives an additional bound on D. 





3.6 Determination of D by Series Expansions 


When the coefficients in f3(p) or fs(p) are either large or small, D 
can sometimes be expanded in a power series around a known root. 
This again restricts the parameter ranges over which D must be deter- 
mined numerically. For example, suppose h’ is small. The roots of f3(p) 
at h’ = Oare p = 0, p = +4iv/3(b — c). However, it is well known” 
that each of the three branches of the triple-valued function p = p(h’) 
is analytic in h’. Expanding around h’ = 0, say for p(0) = i+/3(b — c), 
we have 


Ss 
I 


iva +N (BP) to 
in/3(b — c) —Whtan’a+---, 


with similar expressions easily obtained for the other two branches of 


p = ph’). 
A particular case of interest is that of a spindle-shaped body. In 
this case, c > 0, b — 1, and the coefficients a1, ---, as of fs(p) all be- 


come large. One can then consider the equation cfs(p), in which the 
leading coefficient is small. However at c = 0, this equation is singular 
because it is reduced in degree from a quintic to a quartic. The quar- 
tic, with coefficients ca1,caz, ++, cas, gives only four of the limiting 
roots as c — 0. The fifth limiting root is however easily found by setting 
p = a/c, yielding fs*(c) = cfs(c/c). Application of the expansion the- 
orem to fs*(c) then yields the fifth limiting root p > —2hh'/c asc > 0. 
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This root gives a highly damped mode, and has a real part far in the 
left half-plane. The roots of interest in finding D are thus those of 
the limiting quartic: 


fp) = p* + (B:/h')p*® + Bop’ + (B3/h’)p + Bs = 0, 


where 


B= 2h+1, Bb 
Bo = 2h + 4, Bs 


The limiting quartic fs(p) is a function of the three parameters x,h,h’, 
whereas, in this limiting case, the cubic f3(p) is a function of x, h, h’, and 
a. It turns out that D = D(h,h’,x,«) can be obtained graphically. Fur- 
ther, D,, , the maximum D for all possible h,h’,x,a, can be found and has 
the value D,, = 0.317, attained at the values 


0.77 <h < 0.78, h’ = 1,29, xk = 0.92, a = 38°. 


OR PD) ede oh. 
K(2h + 4) — 2h. 


(The value D,, corresponds to an asymptotic settling time 
T, = 1/2cD,, = 0.502 


orbits.) However, the description of the graphical technique and the 
derivation of D,, are too lengthy for inclusion here. 


3.7 Computation of the Over-All Small-Angle Response 


The over-all small-angle performance of the satellite attitude control 
system is characterized by its steady-state response to constant and 
periodic disturbances (solar torques, magnetic torques, orbital eccen- 
tricity) and by its transient response to sporadic disturbances (initial 
injection, micrometeorite impact). In proper design, one wants to 
diminish the response to all disturbances to below a suitably small level. 

The steady disturbances have their main components at zero, orbital, 
and twice orbital frequency. As indicated earlier, their amplitudes may 
be diminished by inertia augmentation with extensible rods. Fortunately, 
it is easy to write down the formulas, (4)-(8), for satellite response to 
steady disturbances, and also easy to program these formulas for digital 
computation. 

The computation of the transient response is not so straightforward, 
even in terms of the single measure D. An interesting theoretical problem 
is to find the maximum D as a function of all system parameters. Gradi- 
ent or steepest descent methods, which first come to mind for the 
solution to this problem, seem to be difficult to apply, since the maxi- 
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mum D usually occurs in the neighborhood of multiple roots where the 
function D = D(b,c,h,h’,x,a) is singular. 

However, although this is a theoretically interesting problem, its solu- 
tion is not of great practical importance, as indicated in Section 2.1. It 
is more important to have a cheap method of computing D. A method 
that we have found useful involves the Routh criteria as follows: 

Write the polynomial f(p) as 


f(p) = agp” - ap" ap eae np” +e +a = 0, 
and form the Routh array 


Go, 2,4, °°" y 
bo, bo, ba, **°, 
Co, 2, Cay, °°" y 


do, d2,ds4,°--, 
where 
bo = ay, be = Ogre" , box = G2i41, °°" 3 
and 
Coy = Aei+e — (b2i4200/bo), 
dy; = boise — (Cors2bo/co), ete, 7 = 0,1, 2,---. 


Then the number of sign changes in the sequence ao, bo, co, do, ete. 
(providing no term is zero) is the number of roots in the right-half plane. 
Because of its recurrence structure, this scheme is easily programmed ona 
digital computer. 

To determine the real parts of the roots of f(p), one applies the scheme 
to a succession of translated half-planes as follows. If p = —D + ¢, then 


(C=Daes) 
7 2 (n) ¢ __ n 
GSD iss. DE 


pA n!} 
= go + ae + $a = 0, 


where it is easily verified that 


= f(—D) + f'(—D)& + 


s (=D) 
me a 
_ f(D) | 


qi (n _ 1)! —nayD + an, 
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o(—D) = 
a = n 2 n 1 
= aD” — aD + a 
ee CS) (2) ( |) aD tee, 


_ i? (-D) 
Gee 


= () ay(—D)* + @ _ 7 a(—D)*? + +++ + ay. 


The Routh array applied to the coefficients qo, m1, --- , qx then indi- 
cates the number of roots to the right of the line Rep = —D (Re¢ = 0). 
In order to locate the real parts of the roots to an arbitrary degree of 
accuracy, one applies this array on a sequence of nested intervals. For 
example, start with some large D = D* such that the Routh array ap- 
plied on Re p = —D* indicates roots to its right. Take as the initial 
interval —D* < Rep < 0. Inastable system there will be roots between 
the right boundary (Re p = 0), and the left boundary (Re p = —D*). 
Next apply the Routh criteria on Re p = —D*/2. There are two possi- 
bilites: (a) if there are no roots to the right of Re p = —D*/2, make 
this the new right boundary; the interval —D* < Re p < —D*/2 now 
has the same properties as the initial interval, (b) if there are roots to 
the right of Re p = —D*/2, make this line the left boundary of the new 
interval —D*/2 < Re p < 0, which again has the same properties as 
the initial interval. By applying this process n times, one ends up with 
an interval of width D*/2”, which contains roots but has no roots to its 
right. The accuracy of the location of the real parts of the roots closest 
to the imaginary axis can be set by prescribing the width of the final 
interval. Since the widths of the successive intervals go down as 1/2”, 
the process converges rapidly. 

After the real parts of the roots closest to the imaginary axis are found 
within some interval of desired width, say Interval 1, the same procedure 
can be used to find the next closest roots to the imaginary axis. One 
starts again at some sufficiently large D*, such that some roots fall to 
the right of Re p = —D* and to the left of Re p = —Dzu1, the left 
boundary of Interval 1. One makes these the left and right boundaries 
respectively, of an initial Interval 2, and applies the nested interval 
iteration again. The right boundary of Interval 2 in each iteration is 
characterized by having m roots to its right, where m is the number of 
roots contained in Interval 1. 

The starting value D* can be chosen in various ways. If one is inter- 
ested only in the roots closest to the imaginary axis, he can pick D* as 
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| @/G) | 
jem / (2) 


for then (see Section 3.5) there will be at least one root to the right of 
Re p = —D”*. If it is desired to find the real parts of all the roots, D* 


can be chosen as 
am 
D* = max — 
Qmn-1 


since it is well known™ that this value of D* is a bound on the modulus 
of the maximum root and hence all the roots will be to the right of 
Re p = —D*. 

We remark that this procedure may be easily extended to a method 
for finding both the real and imaginary parts of the roots of a real 
polynomial. It is only necessary to use well-known relations between the 
imaginary parts of the roots and certain members of the Routh array. 

The above scheme goes rapidly on the IBM 7090 computer. For ex- 
ample, if the widths of Interval 1, Interval 2, etc. are set at 0.005, the 
running time is about 1000 cases a minute to find the real parts of all 
the roots of both the quintic, f;(y), and the cubic, f3(p). Tables caleu- 
lated by this process were used in making the parameter survey whose 
results are summarized in Section 2.3. 


IV. LARGE-ANGLE MOTION 


4.1 Introduction 


The large-angle motion of the satellite is of course governed by non- 
linear differential equations, which in general must be integrated numer- 
ically. Nevertheless, a few analytical and intuitive insights are available. 
These are pointed out in the sections which follow. 

We begin with a discussion of the pertinent dynamical and kinematic 
equations, including the effect of variable inertia, due to rod erection. 
Then we enumerate the equilibrium positions of the satellite, in which it 
is at rest with respect to the orbiting reference frame in a circular orbit, 
and show that certain restrictions must be placed on gyro angular mo- 
mentum to eliminate undesired positions. This is followed by a discus- 
sion of satellite despin by the erection of a single rod and tip mass. 
Finally we show how the satellite may be inverted by ground command 
to the gyro torquers. 
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4.2 Large-Angle Equations of Motion 


In the following, we make an explicit distinction between dynamical 
equations, valid in any coordinate system when written in proper vector 
form, and kinematic relations between various specific coordinate sys- 
tems. This allows us to introduce a minimum number of different co- 
ordinate systems and to avoid a good deal of irrelevant algebraic com- 
plexity. 


4.2.1 Dynamical Equations 


The rate of change of angular momentum L about the satellite center 
of mass, with respect to a reference frame rotating at the satellite angular 
velocity w; , is governed by the equation 


L+o,XL=M, (9) 


where M is the resultant torque around the center of mass, the sum of 
the gravity-gradient torque M<, the total gyro precession torque M, , 
and the external disturbing torque M, . For a rigid body 


L =I-a, (10) 


where I is the inertia dyadic, given in terms of the principal moments of 
inertia A > B > C and corresponding principal vectors i’,j’,k’, by 


I = Ai’/i’ + Bj'j’ + Ck’k’. (11) 
If w is the satellite angular velocity relative to orbit reference axes, 
wo, = iv + o, (12) 


where i is a unit vector normal to the orbit plane and y(t) is the polar 
angle of the satellite center of mass, measured from orbit perigee in 
earth-centered coordinates and satisfying the orbit equation 


y = 1 + ecosy)?/(1 — 2)3, (13) 


where ¢ is the orbit eccentricity and Q = 22/7), To being the orbit 
period. The gravity-gradient torque Mg is given by 


Mg = 30°(1 + ¢€ cosy) ‘[k X (I-k)J/(1 — &)°, (14) 


where k is a unit vector directed along the local vertical toward the 
center of the earth. Here and in the following, we consider only what 
Beletskii”® calls the “restricted problem” for which the motion of the 
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center of mass is given by (13) and is unaffected by the motion around 
the center of mass. Finally the resultant gyro torque for a two-gyro, 
roll-vee configuration is 


M, = Hi X Wg, + H, X Wyo 5 (15) 


where H,’s are the gyro angular momenta, of fixed magnitude H, and 
the w,,’s are the gyro gimbal angular velocities. In terms of the gimbal 
angles g,, and the nominal roll-vee half-opening angle a, we have 


Wg, = Os + j'o5; ’ (16) 
Hi = H{i’ cos (a — ¢,,) + k’ sin (a — ¢,)] (17) 
H, = H{i’ cos (a + g,,) — k’ sin (a + g,,)]. (18) 


The set of dynamical equations is completed by the gimbal equations of 
motion. If the gyro gimbals are not against stops, these are 


Cnda; a K¢o; = M,, a j- (A; x @g;), (19) 
where Cp is the gyro damping constant, K the gimbal spring constant, 
and the constant bias torques M/,, are given by M,, = —M,, = HQ sina. 


When the gimbals are against stops, the reaction torques from the 
stops on the gimbals must be added to (19). 


4.2.2 Kinematic Relations 


The orientation of the satellite body axes x’,y’,z’, or the corresponding 
unit vectors i’,j’,k’, with respect to the orbit axes x,y,z or corresponding 
unit vectors i,j,k,“ may be specified in a number of ways. In classical 
dynamics, Euler angles have been traditionally used. They specify a 
rigid body’s orientation with a minimum set of three numbers, and, in 
some of the soluble problems of rigid body dynamics, lead to straight- 
forward analytical manipulations. 

From a computing point of view, Euler angles, however, have three 
serious disadvantages: (1) they involve trigonometric functions, which 
are expensive to compute, (2) they are singular when the nutation 
angle is zero, and (3) they are difficult to use in the visualization of 
complicated motions. We have chosen to use the so-called Euler parame- 
ters, rather than the Euler angles. A set of variables, perhaps even more 
suitable for the matrix algebra typical of modern computer programming, 
might be the direction cosines a,8,y, etc., satisfying the relations 


* See Section 3.1. 
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I 


lo*by a + kia’, 
j= 16 + je + kB", (20) 
k = i’y + jy’ + k’y”. 

These a’s should not be confused with the nominal vee opening angle, 


which we shall distinguish from the direction cosines, whenever they 
are used together, with a subscript. By using identities of the form 


kt+oXxXk=0, (21) 


satisfied by i, j, and k, we may obtain 9 equations giving the rates of 
change of the direction cosines in terms of the direction cosines and the 
components of w. We would then have 15 equations, including 3 satel- 
lite equations of angular motion, 1 equation of motion for the satellite’s 
mass center, 2 gimbal equations of motion, and 9 equations for the 
direction cosine rates, yielding the 3 components of w, the satellite polar 
angle y, the 2 gimbal angles, and the 9 direction cosines. The identities 


a ae B + 7 = ab gl? +e” = 1, ete, 
aB + a’B’ + a’B” = aa’ + BB’ + yy’ = 0, ete., 


i 


which must be satisfied initially, would then serve as checks on the 
numerical solution. Incidentally, it should be noted that the cosine of 
the antenna pointing error angle is given by the direction cosine y”, 
between the local vertical and the body yaw axis. 

We shall use the direction cosines to study equilibrium positions, but 
Euler parameters in the study of general satellite motion, since they are 
simply related to the deflection angles for small motion. If we assume 
that the (2’,y’,z’)-axes are formed by rotation of the (2,y,z)-axes through 
the angle @ around an axis with direction cosines mz , my , m., the Euler 
parameters & , & ,&, x are defined by the relations 


(Ex, &y, &) = (mz, my, mM.) sin (0/2), x = cos (6/2). 
We now have” 
i= i'(E, — &) — & + x’) + 27’ (Eby — xb.) + 2K’ (Exe. + xby), 
j = (ety + x6) + (-& + & — & +x) 
+ 2k’ (EyE. — xée), (22) 
k = 2i’(£.t, — xéy) + 25’ (eve. + xée) 
+k(-& -—§/) +8 4%’), 
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giving the direction cosines, and 
Qi. = xwr + Eyer — Ewy, 
2b, = xwy + Ewe — Ewe, 
2b. = xwe + Exwy — Eyoe, 
—2X = Ewr + bywy + E02, 


completing a set of 10 equations for the three components of angular 
velocity w, two gimbal angles, four Euler parameters, and the polar 
angle y. Just as in the case of the direction cosines, the single identity 


e+e +e tx = 1 
serves as a check on the numerical solution. Also note that, for small 


rotation 6, 2&, ™ Pry 2ky ™ Py; 2&, ™ G25 X™ 1, War ~ Pry Wy ™ by, 
Wy ™ %,, and 


(23) 


i~i-je+ky,, 
j~iet+ ij — k’e., 
k ~ -i'g, + Jo. + ¥, 


where ¢; , gy , ¢: are the small pitch, roll, and yaw angles. If these rela- 
tions are inserted into the dynamical equations and second-order terms 
neglected, the linear equations for the small motion, (1), are obtained. 

In coding the differential equations for the digital computer, it was 
found convenient to define cross-product and dot-product subroutines: 


A X B = (—A3B, + AoB;, A3Bi — AyB3, —A2B, + A1B2), 
A-B a A,B, -- AB, + A2B3 . 


This allowed the coding to follow closely the vector form of (9)-(11), 
which was useful from the standpoint of both coding simplicity and 
debugging. 

For nondumbbell satellite shapes, say b = 0.9, c = 0.5, the five-point 
predictor-corrector with 3 rule as given by Hamming (Ref. 22, Chapter 
15) was used. However, in the spindle case, b = 1.0, c = 0.01, the dif- 
ferential equations become singular because the small number c multi- 
plies a derivative, and the five-point, } rule scheme was found to be 
very slow. Following a suggestion of R. W. Hamming, a simple three- 
point predictor-corrector scheme (Ref. 22, p. 186) was then tried. It 
turned out to be three to four times faster than the five-point scheme 
and to give about the same accuracy. 
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In the computer runs, the gyro stops were simulated by hardening 
springs. For example, for g,, > 8, the normal spring restoring torque of 
Kg,, was replaced by 





Kya, + Ble, — B+ 5_ > 


where B, C, 8, and @ are constants, to simulate the pitch stop of the first 
gyro. The same expression but with different constants was used for the 
yaw stop. Specifically, in all the computer runs for the spindle-shaped 
satellite reported here, K = 0 and the pitch-stop values for Gyro 1 were 


B=50A%, C=001A9%, B=58° 06 = 60° 


Yor the yaw stop, 8 and 6 above were replaced by 6 = —20° and @ = 
—30°. Corresponding, symmetrical stop constants were used for 
Gyro 2. 


4.2.3 The Rate of Change of Energy 


For a circular orbit we can easily obtain a useful expression for the 
rate of change of kinetic and potential energy, relative to orbit axes. 
We take the scalar product of the satellite equation of motion (9) with 
the relative angular velocity w and combine it with the two gyro equa- 
tions, (11), multiplied by ¢g,, . After some routine algebra, we obtain 
the relation 


(d/dt)(T + V + @) = —Co($o," + G02); 

where the relative kinetic energy 

T = jo0-l-o, 
the potential energy 

V = 30°(3k-I-k — i-I-i), 

and the gyro energy 

G=G+G, 
with 

G; = 4Kg,, + M,,¢,, — 9-H. 


This expression is useful in the estimation of various quantities, in par- 
ticular the velocity required to tumble the satellite, and conditions 
necessary for capture.” 


SATELLITE ATTITUDE CONTROL 2755 


4.3 Satellite Equilibrium Positions 


When the satellite moves in a circular orbit in such a way as to remain 
stationary with respect to the local vertical, its motion satisfies the 
equilibrium equation 


Mi X (I-i) = 30°k x (I-k) + 0H xX i, 


where His the resultant gyro angular momentum. For a symmetrical 
satellite 


I= A(ii’ + jj’) + Ck’ 
and the above equation yields the relations 
(A — C)Bry" = 0 
H-j = HA, = (A — C)Qa’ 8", (24) 
H-k = H, = 4(A — C)Qa"y”, (25) 


with a” = i-k’, 8” = j-k’,y” = k-k’, as defined by the table of direction 
cosines, (20). Thus we have the following general result: 


I. The equilibrium positions of any symmetrical, gravity-ortented, gyro- 
stabilized body in a circular orbit must be such that the principal axis of 
least inertia and the resultant angular momentum are perpendicular either 
to the orbit roll axis (8” = H, = 0) or to the orbit yaw axis (y” = H, = 0). 


In the case of a roll gyro system, with all gimbal axes parallel to the 
body roll axis, the resultant gyro angular momentum must have the 
form 


H=i/H, + k'H,, 
so that 

H, = BH,» + B"H,, 

H, = yHy + y"H,. 
Thus, 8” = H, = 0 implies BH, = 0, andy” = H, = 0 implies yH, = 
0. If we now assume that the motion of the gyro gimbals is restricted by 


stops along the body yaw axis, so that H, > 0, an assumption appropri- 
ate for the case of the two-gyro roll-vee, we have the following result: 


II. The equilibrium positions of any symmetrical, gravity-oriented, roll- 
gyro-stabilized body in a circular orbit must be such that the body roll axis 
is either parallel to the orbit roll axis (B’ = +1) or parallel to the orbit 
yaw axis (y’ = +1). 
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So far we have not made use of the gyro gimbal equilibrium equations. 
If the gimbals are not against stops, from (19) for the roll-vee, these 
take the form 


Qj’-(Hi X i) — HX sin a — Kyg,, = 0, 
Qj’: (BH, X i) + HQ sin a — Ke,, = 0, 


where we now denote the nominal gimbal angle by ao to avoid confusion 
with the direction cosines. If the flex-lead constraint is negligible, i.e., 
Kk = 0, these two equations imply that 


y'H, — BH, =0 


which in either case in II (6’ = +1, H, = Oory’ = +1, H, = 0) 
implies that H, = Hz = 0, so that no torque is exerted on the satellite 
by the gyros. We then have: 


III. The equilibrium positions of a symmetrical, gravity-oriented, free 
roll-vee-gyro-stabilized body must be such that the resultant gyro torque 
vanishes (i X H = 0) and either the body pitch, roll, and yaw axes are 
parallel to the orbit pitch, roll, and yaw axes (! = +i, j’ = +j, Rk! = 
+k) or the body pitch, roll, and yaw axes are parallel to the orbit pitch, yaw, 
and roll axes, respectively (i! = +i, j’ = +k, k’ = +)). 


The signs of course must be chosen so that the above represents a 
proper rotation. Note that the above applies to any roll gyro system for 
which the resultant torque around the body roll axis exerted by the 
gyros on the satellite vanishes. If 7’ = +7, the second set of equilibrium 
positions gives Fig. 20, with the gyro gimbal axes along the orbit yaw 
axis in a yaw-vee configuration. Small pitching motion around these 
equilibrium positions is governed by a characteristic equation of the 
same form as that for the roll-vee, (1), except that the coefficient 
3(B — C)/A > 0 is replaced by 3(C — B)/A < 0. Thus these equi- 
librium positions are unstable. 

The equilibrium position i’ = i, j’ = j, k’ = k of the first set is shown 
in Fig. 19(a). It corresponds to the normal operating position with the 
body yaw axis, on which the antenna is situated, directed toward the 
earth. The inverted position (see Fig. 19b) i’ = i, / = —j,k’ = —kis 
also stable, since it merely corresponds to an interchange of the two 
gyros. This bistability is characteristic of gravity-oriented bodies and a 
gravity-oriented communications satellite must either use two antennas, 
with associated switching, or incorporate some means of flipping the 
satellite in response to ground command. The latter possibility is 
discussed in some detail in the sequel. 
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The reversed roll-vee equilibrium positions Fig. 21(a), with i’ = —i, 
remain to be investigated. The corresponding yaw-vee positions are still 
unstable. If the gyro gimbals were completely free, satellite precession 
and the bias torques, now acting together, would rotate the gyro gim- 
bals from the reversed roll-vee until they formed a normal roll-vee 
around the orbit pitch axis. But with gimbal stops, making the angles 
--a”™ with the body pitch axis, the gimbals rotate until the stop reaction 
torque and the bias torque sustain the 1 rpo steady precession of the 
satellite in orbit. The stability of this reversed roll-vee position can be 
investigated by using the characteristic equations for the normal roll-vee, 
with H cos ao replaced by —H cos a’ and a large spring constant K*, 
introduced to take the stop compliance into account. In particular the 
coefficient 


a, = [(A — B)Q + 2H cos al[4(A — C)Q + 2H cos aol/BCOQ, 
in the ikea characteristic equation, is replaced by 
as’ = [(A — B)Q — 2H cos a’ |[4(A — C)Q — 2H cos a*|/BCO. 
If 
(A — B)Q < 2H cosa’ < 4(A — C)Q, 


this is negative and the equilibrium position is unstable. 

The instability of the reversed roll-vee when a’ satisfies the above in- 
equalities is shown in Fig. 29. In this case, the system parameters are 
the same as those of the sample design in Section 2.3.2 with a* = 80°. 
Initially the gyros are against the stops and the satellite has rates of 0.05 
rpo about all three axes. It is seen that the satellite turns around the 
yaw axis and settles down to rest in the desired orientation in less than 
five orbits. 

When the gyro gimbals are against stops, the gyros exert a torque on 
the body and in general there are other, skewed equilibrium positions. 
To investigate these positions without getting involved in the details of 
stop compliances, etc., which depend on the specific gyros used, we con- 
sider only two idealized cases, the first with stops along the positive and 
negative body yaw axes but with no stops along the body-pitch axis, and 
the second with stops along the pitch axis as well as along the yaw axis. 

In both cases the gyro spin axes may be back-to-back along the yaw 
axis, but this is a case of zero net gimbal torque already treated and is 
easily eliminated by moving the gimbal stops in slightly. In the first case 
both spin axes may lie along the body yaw axis against stops, so that 


Hi=H.=kH, H=2kH 
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Mie: 29 — Instability of reversed roll-vee when gyro stops are suitably dis- 
pose 


From (24) and (25) we again have two cases to consider: (a) B” = Hy = 
0, and (b) y” = H, = 0. In case (a) we have 


H, = 2y"H = 4(A — C)Qa"y”. 


The subcase 6B” = H, = 0; 7” = 0 is easily shown to be unstable, so 
there remains only the position given by 


” — H/(2(A — C)Q]. 


Unless H > 2(A — C)Q, this yields an equilibrium position which can 
be maintained by the stop reaction torques. These torques are of course 
one-sided, since the stop can only “push” and not “pull.” This undesira- 
ble skew equilibrium position can be eliminated by making 


H > 2(A — C)Q. 
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A similar position for case (b) (y” = H, = 0) can be eliminated by satis- 
fying the less restrictive condition H > (A — C)0/2. 

The corresponding situation with yaw and pitch stops finds one gyro 
against the yaw stop and the other against the pitch stop (see Fig. 21b). 
For example, suppose 


H, = k’H, H, = iH, H = H(i’ + k’). 
Now in case (a), 6B” = H, = 0, we have 
H, = H(y + 7”) = 4(A — C)Qa"y”, 
or, in terms of the angle @ between the x and 2’ axes, 
sin 20 = —[H/2'(A — C)Q] sin [6 + (2/4)]. 


The two roots of this equation in the interval —7z/4 < 6 < 7m are ex- 
cluded, because they require stops which “pull” on the gimbals. On the 
other hand, the two roots in the interval t < 6 < 32/2 yield possible 
equilibrium positions. These roots exist only if H < 23(A — C)Q. Again 
the case (b) y” = H, = 0, yields no equilibrium positions of this type 
under the less restrictive condition H > 27'(A — C)Q. Since an increase 
in gyro angular momentum tends to degrade the transient performance 
of the system, we shall assume in the following that the gyro gimbals are 
limited in excursion by both yaw and pitch stops, so that only the re- 
striction H > 2'(A — C)Q need be satisfied. 

In the case of an unsymmetric satellite, a similar but more complicated 
analysis of the equilibrium positions can be carried out. 


4.4 Rod Extension and Satellite Despin 


We have already indicated the necessity of augmenting the satellite 
inertia to increase the gravity-gradient restoring torques to required 
levels. If this inertia augmentation is done after injection into orbit, it 
also reduces the satellite angular velocity to a level where the gravity- 
gradient torques may become effective in aligning the satellite with the 
local vertical. One method of inertia augmentation is the extension of so- 
called STEM rods described in detail in Ref. 8. These are beryllium 
copper tapes which form straight, tubular rods when unwound from a 
drum. If they are used together with dense tip masses, the satellite in- 
ertia may be increased by several hundredfold without a proportional 
increase in solar torque. In the following sections we first consider the 
effect a variable inertia has on the general form of the satellite equation 
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of motion and then discuss satellite despin using a single extensible rod 
in combination with two gyros. 


4.4.1 Equations of Motion for Variable Satellite Inertia 


We may derive all of the dynamical equations for the motion of a 
gravity-oriented body by integration of the general equations of motion 
for a continuous medium. In fact this is perhaps the most direct way of 
calculating the gravity-gradient torque, which is due to the variable 
gravitational body force acting on each mass element of the body. The 
resulting equation of motion, (9), applies to rigid and flexible bodies 
alike, provided that the angular momentum L is calculated correctly. 
L is given in general by the integral 


L = es xX (dr/dt + wo, X r)dm, (26) 


where r is the radius vector from the center of mass of the body B to the 
mass element dm. For a rigid body, r differs from its initial value ro only 
by a rotation and dr/dt = 0, yielding the usual form 


L= | eX (xX ndm = Les, 
B 
but in general r depends both on rp and 1, so that 
L =I-o, + / r X (dr/dt)dm, 
B 


where the inertia dyadic I depends on ¢. 

Let us now consider the extension of a single massless rod with tip mass 
m,. If a(t) is the radius vector from the center of mass of the satellite 
proper to the tip mass, (26) yields 


L =I1-o, + ma X 4, 
IX wo, = Ilo X ws + ma X (os X 4), 


m = mgms/(Ma + Ms), 


for satellite mass m, , and the inertia dyadic for the satellite around its 
center of mass 


ly = Aoi/i’ + Boj'j’ + Cok’k’. 


When the rod is erected parallel to itself, as would normally be the case, 
a X a = 0 and the effect of rod extension is entirely taken into account 
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by the time-dependent inertia I. If the rod is extended along the axis of 
minimum moment of inertia, a = k’a(t) and 


I = (Ao + ma’)i’i’ + (Bo + ma’)j’'j’ + Cok’k’. 


4.4.2 Satellite Despin 


When the moments of inertia of a torque-free spinning body are in- 
creased by a factor of N, conservation of angular momentum requires 
that the angular velocity of the body decrease by a factor of 1/N. On 
the other hand, if the spinning body contains a spinning rotor, an increase 
in the angular momentum of the rotor produces a corresponding decrease 
in the angular momentum of the body and hence a reduction in the 
angular velocity of the body. Both elements exist in the gravity-oriented, 
gyro-stabilized satellite. Inertia augmentation is required to obtain 
gravity-gradient torques of the proper level; rotation of the gyro gim- 
bals provides a change in angular momentum. 

A single extensible rod-tip mass combination provides adequate 
gravity-gradient torques, if erected along the satellite yaw axis. How- 
ever, erection of such a rod reduces only pitch and roll injection angular 
velocities; the yaw component is unaffected. This may be removed by 
using the gyros as reaction wheels. 

Suppose the gyros are caged at their null position during the rod erec- 
tion phase. Then, neglecting gravity-gradient and external torques 
during the short erection time, we have a freely spinning body. If the 
initial components of angular momentum are all of the order of magni- 
tude AoNQ, where A» is the moment of inertia about all three axes, we 
may reduce the pitch and roll components of angular velocity to order 
Q (1 rpo), with respect to inertial space, by extending a single rod that 
increases the pitch and roll moments of inertia from Ap to A = NAo. 
The yaw angular velocity remains equal to NQ. 

The yaw angular momentum, A,NQ, however, is of the order AQ, the 
same order of magnitude as the angular momentum H of each gyro. The 
gyros then are large enough to absorb the residual angular momentum. 
If the gyro gimbals are now released, the spin axes will tend to line up 
with the residual angular velocity around the yaw axis. One gyro spin 
axis rotates until constrained by the yaw axis gimbal stop; the other ro- 
tates until constrained by the pitch axis gimbal stop. There is thus a net 
change in yaw gyro angular momentum of order H and, furthermore, 
because of the rate-seeking property of the gyro, it always occurs in the 
correct sense to reduce the satellite angular momentum. 

This qualitative argument has been supported by computer runs for 
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given parameter values and initial rates. In particular one may determine 
the gyro size needed to despin from a given initial yaw rate. Actually it 
is not necessary to separate the erection and uncaging phases of injec- 
tion, so long as the gyros are not uncaged before erection. 

The result of such a computer run was given in Figs. 17-18. In one 
orbit the angular rates are reduced to less than 1 2 and capture occurs. 


4.5 Satellite Inversion 


As we have already noted, gravity-oriented bodies are bistable, i.e., 
they are in stable equilibrium with the axis of least inertia, on which the 
antenna would be mounted, both directed toward the earth and away 
from the earth. In this section we discuss a method of flipping the satellite 
by means of a simple ground command injected into the gyro gimbal 
torquers. 

When the satellite is in either of the above stable equilibrium positions, 
its total angular momentum is AQ + 2H cos ap around the pitch axis. 
If we could somehow rotate the two gyro gimbals instantaneously, so 
that both spin axes pointed along the pitch axis, the total angular mo- 
mentum would become A(Q + w) + 2H, where w is the pitch angular 
velocity with respect to the orbit frame. Since the gimbal rotation is 
assumed instantaneous, the total angular momentum is conserved, i.e., 


AQ + 2H cos a = A(Q+ w) 4+ 2H, 
or 
w = —2H(1 — cos a)/A. 


Thereafter, the single-axis, pitching motion is governed by an equation 
of the form 


Ag + 3(B — C)o’ sin ¢ cos g = 0, 


where ¢ is the pitch angle around the orbit pitch axis. A first integral of 
this equation is 


Ag’ + 3(B — C)Q’ sin’ g = Aw, 


since (0) = 0, (0) = w. In order that ¢ be one-signed, i.e., in order 
that tumbling occur, we must have 


Ao’ > 3(B — C)9? 
or 


(H/AQ)* > 3(B — C)/4A(1 — cos av)”. (27) 
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For a gyro angular momentum satisfying this condition we can excite a 
tumbling motion by collapsing the gyro spin axes toward the pitch axis. 
Actually, since we only want to rotate the satellite through a half revolu- 
tion, the gyro angular momentum need barely exceed this minimum 
value. Furthermore, we may collapse the gyro spin axes toward the 
pitch axis by simply reversing the bias torques applied to the gimbals for 
a suitable length of time. For a spindle satellite with (B — C)/A = 
0.99, computer runs (see Fig. 22) show that the satellite may be inverted 
by applying this reversed bias for about a half orbit. For a satellite with 
(B — C)/A = 0.4 it turns out that it is only necessary to remove the 
bias torques for a fraction of an orbit. For any satellite a suitable com- 
bination of bias torque and time can always be found to flip the satellite 
into its desired operating position, providing the relation (27) is satis- 
fied. 

We remark that bias torques could also be used to rotate the gyro gim- 
bals against the yaw stops. A similar, single-axis argument then gives an 
expression like (27). However, with the gyros back-to-back against yaw 
stops, the satellite has negligible yaw stiffness, and is vulnerable to yaw 
disturbances. This possibility of inversion was therefore not pursued. 
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APPENDIX 


List of Symbols 


A,B,C principal moments of inertia 

b = B/A,ce = C/A dimensionless principal inertias 

Cp gyro damping constant 

D = 1/2rT, damping rate, inversely proportional to set- 
tling time 

H, , He angular momentum vectors of gyros 1 and 2 

H =(|H,| = |H,| magnitude of gyro angular momentum 

H=H,+ H;, resultant gyro angular momentum vector 

Vg ores See oe orbital pitch, roll, and yaw components of 


total gyro momentum 


2764 
Hy ’ Ay ’ H,, 


h = (H/AQ) cos a 
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Optimum Reception of Binary 
Gaussian Signals 


By T. T. KADOTA 
(Manuscript received May 4, 1964) 


The problem of optimum reception of binary Gaussian signals is to 
specify, in terms of the received waveform, a scheme for deciding between 
two alternative covariance functions with minimum error probability. 
Although a considerable literature already exists on the problem, an opti- 
mum decision scheme has yet to appear which is both mathematically 
rigorous and convenient for physical application. In the context of a general 
treatment of the problem, this article presents such a solution. The optimum 
decision scheme obtained consists in comparing, with a predetermined 
threshold k, a quadratic form (of function space) in the received waveform 
x(t), namely, 


choose ro(s,t) af I x(s)h(s,t)x(t) ds dt < k, 


choose r(s,t) af I 2(s)h(s,t)x(t) ds dt = k, 


where r(s,t) and 1:(s,t) are the covariance functions while h(s,t) is given 
as a solution of the integral equation, 


I ro(s,u)h(uv)ri(v,t) du dv = (st) — ro(s,t). 


This may be regarded as a generalization of the “correlation detection” in 
the case of binary sure signals in noise. 

Section I defines the problem, reviews the literature, and, together with 
certain pertinent remarks, summarizes principal results. A detailed mathe- 
matical treatment follows in Section II and the Appendices. 


I. INTRODUCTION AND SUMMARY 


1.1 Definition and Nature of Problem 


The problem of optimum reception of binary Gaussian signals arises 
as a mathematical idealization of a common communication problem. 
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Consider a radio communication link containing a random medium. 
The transmitter sends one of two possible signals with known frequency 
rates (a priori probabilities), and the receiver decides which one of the 
two has been transmitted. Even if the transmitted signals are deter- 
ministic, the observable waveforms at the receiver appear to be random 
owing to effects of the random medium and the ever-present thermal 
noise at the receiver. The task of the so-called optimum (or ideal) 
receiver is to decide, upon observation of the received waveform for a 
finite time, which one of the two signals has been transmitted in such a 
way as to minimize the so-called probability of error. Thus, the problem 
of optimum reception amounts to specifying in terms of the received 
waveform such an optimum decision scheme for given a priori proba- 
bilities. 

It is assumed that the values of the received waveforms at arbitrary 
instants of time during the observation interval, say 0 < ¢ S 1, are 
jointly Gaussian distributed with means zero and a covariance matrix 
which is determined by either one of two known covariance functions, 
depending upon which one of the two signals is transmitted. Then, the 
above problem may be stated as one of testing simple hypotheses as 
follows: Suppose there are two ensembles of real functions of time ¢, 
0 st S 1, which are statistically characterized as being Gaussian dis- 
tributed with identically vanishing mean functions and two distinct 
covariance functions. A sample (function) x(t) is drawn either from 
the first ensemble with probability a (the null hypothesis: Ho) or from 
the second with probability 1 — a (the alternative hypothesis: H,). 
Determine a ‘critical region” A, (a subset of a space of real functions 
Q) for rejecting Ho (or accepting H,) if x(¢) belongs to A, and accepting 
Hy if x(t) does not, in such a way that the associated error probability, 


P.( Aa) = aPo(Aa) + (1 — a) Pi(Q — Aa), CL) 


is no greater than P,(A) for an arbitrary A C Q; where Py and P; are 
two Gaussian (probability) measures defined on (measurable) subsets 
of @ by the two zero mean functions and two covariance functions. 
Thus, the problem of optimum reception amounts to dividing the func- 
tion space into two parts in such a way that the weighted probabilities 
on them specified by (1) are minimum among all possible divisions. 
There are two features worth noting in this formulation. One is the 
lack of uniqueness of the optimum division as a consequence of adopting 
the minimum error probability as the optimality criterion. Namely, it 
is immaterial whether a certain set N (of functions) with both proba- 
bilities zero, i.e., Po. N) = 0 = P,(N), should be a part of A, or 2 — 
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A. , since it does not contribute to the error probability P,. Thus, those 
sets upon which Pp and P, vanish can effectively be ignored. The other 
feature is a stipulation that the division be specified in terms of the 
general sample (function), namely, the general element w of the func- 
tion space Q, so that each sample (a received waveform) can be classi- 
fied as a member of A, or Q — Aq. From the probability theoretical 
point of view, these features dictate specification of the division (or the 
decision scheme) to be made in terms of the “almost all sample func- 
tions” (or “almost surely,” “with probability one,’ etc.) proposition. 
While this offers flexibility in one sense, it presents a restriction in 
another. For example, anticipating the forthcoming results, if the 
division of 2 is made by means of a certain w function on Q, this function 
can be arbitrary or even undefined on the sets of w upon which Po» and 
P, vanish. Yet, if the function is defined as a certain limit (or, obtained 
by a limit operation), then the sense of convergence must be at least 
“for almost all sample functions,” but not ‘in quadratic mean (in the 
mean),” “in probability,” and ‘“‘in distribution,” which are in general 
weaker. 

The problem of optimum reception of binary Gaussian signals may 
be regarded as a generalization of an almost classical problem in com- 
munication theory, namely, optimum detection of binary sure signals 
in Gaussian noise. It is well known that such detection consists in com- 
paring, with a preassigned threshold, the correlation integral of the 
received waveform and a certain function determined by the two signals 
and noise characteristics. More precisely, let {x,, 0 S$ ¢ S 1} bea 
Gaussian process whose covariance function is r(s,t), 0 S$ s,t S 1, con- 
tinuous and positive-definite, and whose mean function is either mo(t) 
or m,(t), both continuous, corresponding to the two sure signals. Denote 
the sample function of the process by x(t) and the threshold by c > 0. 
Then Grenander’ shows that if the integral equation 


I fsDiGy asa) (2) 


has a square-integrable solution, the optimum decision scheme under the 
Neyman-Pearson criterion is the following: 


choose mo(t) if ig x(t)g(t) dt < ¢, 
é (3) 
choose m(t) if I x(t)g(t) dt = «. 


Suppose the two sure signals in the above problem are replaced by two 
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stochastic (Gaussian) signals and the additive noise is included in these 
signals so that the decision between two sure signals becomes now the 
decision between two Gaussian signals. Furthermore, suppose the 
optimality criterion is changed from the Neyman-Pearson’s to the 
error-probability minimization. Then, the problem becomes optimum 
reception of Gaussian signals under the minimum error-probability 
criterion. More precisely, let {x,;, 0 < t S 1} be a Gaussian process 
whose mean function is identically zero and whose covariance function 
is either ro(s,t) or ri(s,é), continuous and positive-definite, with the 
accompanying a priori probabilities a and 1 — a respectively. Then 
what are the counterparts of (2) and (3)? That is, under what condi- 
tions can the optimum decision scheme be specified in terms of a corre- 
lation integral involving the sample function, and what is the decision 
scheme itself? 


1.2 Review of Literature 


Despite momentous foundations laid by Grenander in 1950, little 
progress was made toward rigorous solution of the above problem during 
the succeeding decade, due primarily to restrictions of the mathematical 
scope to elementary probability theory. The majority of the work is 
characterized by two features: (7) use (and misuse) of the classical 
method of likelihood ratio and (77) attempts to specify the decision 
scheme in terms of some integrals involving the sample function. In 
order to use the classical method, however, the continuous (parameter) 
process must first be “represented” by a (finite) sequence of random 
variables. Thus Middleton’ and Price * sample {x;,0 S ¢ S 1} to obtain 
the representing sequence z;,,--:, v;, and form their likelihood ratio 
bas: 


ln(@ 4 ee ,,) - (Re Rn) |? 
exp iB > yD [(Ro”) 7 — (Ri eeu} 
wl = 


where Ry” and R,” are two alternative covariance matrices of see 

-, %, given respectively by (Ro) i; = ro(t;,t;) and (Ri); = 
ri(t; ,t;); 7,7 = 1, ---,n. Then, as n — © and each sampling interval 
becomes infinitesimal, the decision scheme is specified in terms of the 
limits of the exponent and the factor before the exponential in (4), 
provided these limits exist. Middleton argues on a formal basis that the 
exponent of (4) becomes an integral 


(4) 
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1 
i LY t dt, 
0 


where the new process {y;, 0 S ¢ S 1} is given as one of the solutions 
of a pair of certain simultaneous ‘stochastic integral equations.”’ Price 
also formally argues that the exponent converges to an integral 


1 1 
[ [ ealou(s0 = go(s0 le ds at, 


where go and g; are given as solutions of a certain pair of ordinary in- 
tegral equations. 

Davis,’ Bello’ and Turin,® on the other hand, make orthonormal 
expansions of the process and use the Fourier coefficients as the repre- 
senting sequence. However, the formulation of Davis and Bello is based 
upon a ratio of probability density functions of two sequences of Jourier 
coefficients corresponding to two separate orthonormal expansions, 
which is not a likelihood ratio; while the fundamental notion in Turin’s 
formulation is “probability density functions of processes,’’? which are 
unbounded functions in general. 

One difficulty common among all the papers is the total absence of 
convergence proofs for series of random variables. As mentioned in 
Section 1.1, the sense of convergence must be ‘for almost all sample 
functions.”’ Yet, for example, it is not clear on what ground the ex- 
ponent of (4) should converge for almost all sample functions to those 
stochastic integrals, nor is the existence of the integrals themselves 
shown. 

The other common difficulty, of a more fundamental nature, is the 
lack of optimality proofs. Considering the process as an ensemble of 
‘well behaved” functions of time, it is intuitively plausible that such 
an ensemble should be “adequately” described by the distributions of 
the “infinitely densely’”’ sampled values of the member functions or by 
the distributions of the Fourier coefficients of some orthogonal expan- 
sions in £2 (the space of square-integrable functions). Namely, the 
continuous (parameter) process should somehow be “representable” by 
a sequence (infinite in general) of random variables. However, the 
optimality of the resultant decision scheme should in general be affected 
by selection of the representing sequence. Obviously, there are in- 
numerable ways of sampling the process, resulting in innumerable 
decision schemes. Similarly, there are as many sequences of Fourier 
coefficients to represent the process as orthonormal bases of £2. Yet, 
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should all the representing sequences eventually yield the decision 
schemes with the same error probability, the minimum? If not, which 
sequences are the best representations? Even if the best sequence is 
chosen, on what grounds will the error probability remain minimum in 
the limit as n — ©, since, after all, the classical method is valid only 
for a finite n? 

Note that there is no a priori need for the use of either likelihood 
ratios or representations, so long as the proposed decision scheme is 
shown to have the minimum error probability. In fact, Slepian’ shows 
interesting special examples (of the singular case) where minimality 
of the error probability is explicitly proved. From a different point of 
view, Parzen recently restores Grenander’s basic formulation, where 
what is called the Radon-Nikodym derivative plays the role of the 
likelihood ratio in the classical theory, and puts the sampling method. 
on a more rigorous basis. 


1.3 Summary of Main Results and Remarks 


Solution of the problem of optimum reception stated in Section 1.1 
rests on the following two fundamental (measure theoretical) facts: 

(a) If Po and P; are two Gaussian (probability) measures, they 
must be either (2) “equivalent,” i.e., Po = Pi, or (at) “orthogonal” 
(or “singular’’), i.e., Po 1 Pi. 

(b) If Po and P; are equivalent, there exists a certain nonnegative 
random variable f(w), called the Radon-Nikodym derivative of P, 
with respect to Po , and a set of w points inQ such that f(w) = a/(1 — a) 
can be taken as the desired critical region, denoted by A, in Section 1.1. 
On the other hand, if Po and P; are orthogonal, there exists a set H 
of w points in 2 such that Po(H) = 0 and Pi(H) = 1, and the critical 
region can be taken to be such a set H. In short, the following set S, 
serves as the. critical region: 


{f(w) 2e/(1—a)} if Po=P,, 
Sq = | (5) 
H if Po L Pi. 
Thus, the problem of determining the critical region now becomes 
the problem of finding such a random variable and a set H. 
Next, through the use of theory of martingales, the following facts 
can be established: 


For almost all sample functions, 
(2) if (and only if) 
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intr (Re) Ri St = RR Se, (6) 


Nn>wo 


then 
lim 1,(@1,, °** > %s,) = f(w) under both hypotheses;* (7) 


(17) if (and only if) (6) is not satisfied, then 


{O under null hypothesis, 
ine et a (8) 
n> co under alternative hypothesis, 


provided that the sequence {¢;} is dense in the interval 0 S ¢ S 1, where 
“tr” stands for ‘trace’ and the likelihood ratio J, , together with 
Ro” and R,°”, is previously defined in (4).t 

Examination of (7) and (8) in conjunction with (5) immediately 
leads to the conclusion that, irrespective of the hypotheses, 


S. = {lim la(%i,,°°° , 22,) 2 a/(1 — @)}. (9) 
Thus, if x(t), --- , v(t,) are the values of the sample function (the 


received waveform) x(t), 0 S$ ¢ S 1, sampled arbitrarily but with the 
restriction that each sampling interval becomes infinitesimal as n —> «, 
then the optimum decision scheme becomes the following: 


choose ro(s,t) if liml,[v(t1),--- ,2(tr)] <a/(1 — a), 


1 

choose ri(s,t) if liml,[a(t,),--- ,v(tn)] 2 a/(1 — @). 
Furthermore, according to (2), if the given covariance functions 7o(s,¢) 
and r,(s,t) are such that (6) is satisfied by their covariance matrices 
Ro” and Rk,” obtained through sampling, then, regardless of whether 
ro(s,t) or r1(s,t) is the true covariance function, the above limit is finite 
for almost all sample functions, and the error probability associated 
with the decision scheme (10) is minimum. According to (77), on the 
other hand, if ro(s,t) and r,(s,t) are such that Ro” and Ry” do not 
satisfy (6), then for almost all sample functions the limit vanishes 
if ro(s,t) is true, while the limit diverges if ri(s,t) 1s true; and, inde- 
pendent of the given a priori probabilities, the associated error proba- 

bility simply vanishes, resulting in the case of ‘“‘perfect reception.”’ 
* Recall that the null hypothesis is the hypothesis that ro(s,t) is the true 
covariance function of the process while the alternative is the hypothesis that 


ri(s,é) is the true covariance function. 
t (6) and (7) are also found in Parzen.$ 
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It should be noted, first of all, that the sequence of sampled values is 
not used to represent the continuous process but to obtain the crucial 
random variable f and set H through formation of the likelihood ratio. 
Secondly, under the assumption of the covariance functions being 
continuous, it can be proved that, regardless of the sampling manner, 
the limit of the likelihood ratio satisfies either (7) or (8), thus yielding 
the same error probability, so long as each sampling interval becomes 
infinitesimal as n > .”* Lastly, negation of condition (6) can be re- 
garded as a necessary and sufficient condition for perfect reception. 

Having obtained the optimum decision scheme (10), the question 
of possible simplification naturally arises next. Examination of the 
form of the likelihood ratio (4) suggests that, if the limits of the ex- 
ponent and the factor before the exponential exist separately, decision 
scheme (10) may be rewritten in terms of these limits. Such an attempt 
already appears in the literature, as mentioned in Section 1.2. However, 
the crucial mathematical consideration hinges upon the condition under 
which such a procedure can be justified. Here, the following condi- 
tion is shown to be necessary and sufficient: 

lim tr [(Ro”) 7 Ri” — I] < © 


N>O 


lim tr [ito Gea) — DT) <= m. 


N>O 


(11) 


Note that this condition implies (6), as it should, and excludes the 
case of perfect reception. In fact, condition (11) states not only that 
the sum of two traces converge as condition (6) requires, but also that 
the two traces converge individually. In conclusion: If condition (11) 
is satisfied, then there exist a positive constant 8 and a random vari- 
able @ such that 


6B = lim | Ro” (Ri) |, . (12) 


n>o 


n 


¢=lim > DD (Ro) — (Ri) eg 0,8; (13) 


n>o i=l j=1 


for almost all sample functions under both hypotheses; and the optimum 
decision scheme (10) is reduced to the following: 


choose 7(s,t) if 6(x) < log (1/8)[a/(1 — a)’, 

choose 7,(s,t) if @(7) 2 log (1/8)[e/(1 — a)}’, 
* This does not imply that two different decision schemes yield the same de- 
cision for every sample function; rather, a set N of sample functions, for which 


two decisions differ, give no contribution to the error probability, i.e., Po(V) = 
0 = P,(N). 


cla) 
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where 6(2) is the value of @ for the sample function x(t), which is ob- 
tained by simply replacing x;, and x;,; in (13) by x(¢;) and x(t;). 

Although the above decision scheme is certainly a step toward sim- 
plification compared with (10), it is still inconvenient, if not unfeasible, 
for physical application, since it requires limit operations for each 
received waveform. Yet, so long as the likelihood ratio is formed in 
terms of the sampled values, elimination of the limit operation appears 
to be impossible. Recall, however, the problem of optimum detection 
of sure signals in noise mentioned in Section 1.1. There, the likelihood 
ratio is formed in terms of the Fourier coefficients of the so-called 
Karhunen-Loéve expansion of the process instead, thus resulting in the 
decision scheme specified in terms of an integral in place of an infinite 
series, as shown by (3). Needless to say, in the present problem where 
there are two covariance functions instead of one, additional mathe- 
matical complications should be inevitable. Nevertheless, an optimum 
decision scheme which is essentially comparable to (3) can be obtained, 
as will now be shown. 

Let \y 2 Xx = -:-- and y(t), Y(t), --- be the eigenvalues and the 
orthonormal eigenfunctions associated with the covariance function 
ro(s,t), and, similarly, let uw. 2 uw. 2 --- and gi(t), g(t), --- be those 
associated with 7,(s,t). Then, it can be shown that, under the assump- 
tion of ro(s,t) and 7,(s,t) being continuous and positive-definite, the 
integrals 


1 
f= | vwi(t) dt, 4=1,2,-::, (15) 
0 


exist for almost all sample functions under both hypotheses, and are 

Gaussian distributed with means zero. Furthermore, the covariance 

matrix determining the joint distribution of & , --- , &: 1s given by either 
(Qo )iz = ALbiz, OF 


(n) = 7 (16) 
(Q.”)ig = Giz = p> MeUniUes, = Wag = [ gi(t)y;(t) dt, 


depending upon which one of ro(s,t) and r;(s,t) is the true covariance 
function of the process. 


Thus the likelihood ratio of & , --- , &, becomes 
; n n)\—1 14 Ise n)\— n)\— 
b= | Qo (Qi) |} exp ‘3 dy 2, (Qo) * — (Qs) a (17) 
i=1 j= 


which corresponds to (4). It turns out that, under the previous assump- 
tion on the covariance functions, there is a complete parallel between the 
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two formulations, one based upon 2;,, --:, 2, and the other based 
upon &, ---, & . Thus, for almost all sample functions, 


(7) if (and only if) 
lim tr [(Qo”)~* Q” — 27 — Qo” (Q”)7] < », (18) 


then 
lim 1,(&, «+, &) = f(w) under both hypotheses; (19) 


(iz) if (and only if) (18) is not satisfied, then 


ae 0 under null hypothesis, 
lim In( i pie ety En) = (20) 


co under alternative hypothesis. 
Then, the optimum decision scheme corresponding to (10) becomes: 


choose ro(s,t) if lim dn[&(x),--+ , &(x)] < a/(1— 2), 


(21) 
choose 7(s,t) if lim /,[&(v),--- ,&(v)] 2 a/(1 — a), 


where £;(z), 7 = 1, ---, n, are the values of the random variables 
£, for the sample function x(t), namely, 


£;(2) -{ x(t)yi(t) dt. 


Again, note first the role of {£,;}, which is not a representing sequence 
of the process but a means for obtaining the crucial random variable f 
and set H by forming the likelihood ratio. Secondly, it can be shown 
that, under the assumption of the two covariance functions being con- 
tinuous and positive-definite, {¢,;(t)} can be used in place of {y,(t)} 
to form {&,}, but not any orthonormal basis of &.. Lastly, as before, 
negation of (18) can be interpreted as a necessary and sufficient condi- 
tion for perfect reception. Completing the parallel, if (and only if) 

lim tr [(Qo”) 7 Qi” — I] < &, 


N>o 


22 
lim tr [Qo (Qi) — I] .< &, aa 


then there exist 8 and @ such that 
B = lim | Qo” Ca ae (23) 


RECEPTION OF BINARY GAUSSIAN SIGNALS 2777 


6 = lim Qu 2 [(Q0 7 = (QU Yg BE; (24) 


for almost all sample functions under both hypotheses; and decision 
scheme (21) is reduced to the following: 


choose ro(s,t) if 6(x) < log (1/8)[a/(1 — a)’, 


ae i ; (25) 
choose r(s,f) if 6(%) 2 log (1/8)[a/(1 — a)f. 


Returning to the original goal of eliminating the limit operation, 


examination of (24) immediately suggests the possibility of rewriting 
@as a quadratic form in x,. That is, if one defines 


(at) = DY bss Wal, (28) 
where 


his” = (CQ) — (Qi) Va, 
then, from (15), 


1 1 
é= lim | { ah (8,t)2a, ds dt, (27) 
N—>O 0 0 


and hij”; i,j = 1, ---, , can be given as a solution of the matrix 
equation 


Qo” (his) Qi” = Qi” —— Qo”, 


or, more directly, h‘”(s,t) can be given as a solution of the integral 
equation 


1 1 
l ro” (su)h™ (uo) (v,t) dudv = (st) — ro (st), (28) 
0 0 
where 


niP(st) = Del, nM) = DX ails). (29) 


Then, the following conjecture should be imminent: 


1 J 
= i / xh(s,t)x, ds dt, (30) 
0 0 


where h(s,t) is a solution of 
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[ [ ro(s,u)hCup)iri(v,t) du dv = 1(s,t) — 70(s,t), (31) 


which are formally the limits of (27) and (28) respectively. The essen- 
tial part of the above conjecture can be shown to be correct. That is, 
if (31) has a solution h(s,t) such that 


1 1 
/ [ h’(s,t) dsdt < ©, then 
0 “0 
i pt on ow (32) 
i i ah(s,t)a,ds dt = lim >> >> hi; &:&; 
0 0 


n-o i=l j=1 


for almost all sample functions under both hypotheses, provided that, 
for all 7,7 = 1, 2, ---, 








ai <1, an > Dd’ fas; |, < K, (33) 
j=1 


1— Do d% — ay | 
k=1 
where K is a positive constant independent of 7 and j. 


Then the optimum decision scheme (25) is immediately reduced to the 
following desired form: 


1 1 1 ay 2 
choose 7(s,t) if [| 2(s)nisda(t) as at H( , 

0 0 B\l—a 

- , (34) 
° if [ f c(syntsde() asia lo 4(-£,) 
choose 1i(s,t) i td ts isdt = Ba ery 


A 
— 
3 
08 





IV 


Difficulty of the proof lies mainly in the fact that, as n increases, the 
coefficients h;;‘” themselves vary with n as well as the number of the 
terms of the sum, yet h‘”’(s,t) must approach A(s,t) in such a way that 


1 1 1 1 
: (n) = 
lim [ i ah” (s,t)a,ds dt = i [ x h(s,t)a, ds dt 


for almost all sample functions under both hypotheses. This accounts 
for need of the auxiliary conditions (33). The first condition is not a 
restriction in physical application since 


oe) fo) 1 
San = mi = i ri(t,t) dt 
i=l j=1 0 


is the average energy of the waveform in the interval 0 S$ ¢ S 1, which 
can always be normalized to assure a;; < 1. Although the remaining two 
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conditions are restrictive, current knowledge of infinite systems of 
equations does not seem to allow their removal. Thus this calls for a 
future investigation of the degree of restriction imposed by them in 
physicial application. 

As anticipated, there is an apparent correspondence between the 
classical case of sure signals in noise mentioned in Section 1.1 and the 
present case of stochastic signals, namely, between (2)—(3) and (31)—(834), 
except for the fact that the constituent functions in the latter case are 
functions of two variables instead of one. As the integral of the decision 
scheme (3) has a simple physical interpretation (the output of a linear 
filter with g(¢) as its impulse response), so does the integral in (84). 
Namely, it is the output of a quadratic filter whose impulse response is 
h(s,t). The advantage of this scheme over the others — namely, (10, 
(14), (21) and (25) — is obvious. Given two covariance functions, the 
impulse response of the filter is uniquely determined by the integral 
equation (81) if a solution exists, and decision is made by comparing, 
with a preassigned threshold, the appropriately sampled output of the 
filter with the received waveform as its input, instead of having to per- 
form the limit operation for each received waveform. 

Finally, it should be remarked that the optimum decision scheme 
above differs formally from those previously obtained by others.* A 
further, and more significant, distinction lies in the assured optimality 
of this scheme, inherent in its derivation, while the optimality of the 
others has yet to be proved separately. Tt 


Il. MATHEMATICAL THEORY 


2.1 Gaussian Processes 


Let {x:, ¢ € T} be a real Gaussian process with a parameter set 
T = [0,1] and a finite dimensional distribution function F’,,,...,,,., which 
is determined by given mean function and covariance function where 
ty, -+*, t, are an arbitrary finite subset of 7’. It is assumed that the 
mean function is identically zero on T while the covariance function 
is positive-definite and continuous on 7’ X T’. In the present problem 
it is desirable to have an explicit representation of the given process 
{v,,¢ € T} on a function space.f 

Let Q be a space of real-valued functions of ¢ € 7’. Let 2,(w) be the 

* Although their work is briefly reviewed in Section 1.2, their decision schemes 
are not stated explicitly in this paper. 

¢ This excludes Parzen’s®’ case where the decision scheme is essentially (10). 


t The next paragraph follows closely Example 2.3 in Supplement, Doob,’ pp. 
609-610. 
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w function with the value ¢(s) if w is the function ¢(-), so that x.(w) = 
¢(s). If the ¢ function w has values ((f), --:, €(f,) at th, ---, ha, 
the condition 


€(h) Spi; Yes CEs) < Pn 


defines an w set, which is denoted by 


{r1;(w) S p:, t= 1s n} (35) 
where pi, -** , px are arbitrary real numbers. Next, let § be the class 
of all w sets obtained in this way for arbitrary n, t;, --- , tr, and let 


@r be the Borel field generated by $, and lastly let P be a probability 
measure defined on the sets of @z whose value is given by 


P{x1,(w) = pis = 1, Pe Pag n} = | er aeeree (pi, eas > Pn)» (36) 


Then, {x:(w), £ € T} isa representation of the given process {x,, § € T} 
on the function space Q, and (Q, @7, P) is the explicit probability meas- 
ure space for the representation. 

(Remark) By virtue of the choice of representation space, the general 
elements of the space © coincide with the general sample functions of 
the process {x,(w), ¢ € T}. Thus, the phrases, ‘almost everywhere (or 
almost surely)” and “for almost all sample functions,” have the same 
meaning. 

The assumption of continuous covariance function has the following 
significant consequences: 

(1) {x.(w), t € T} has an equivalent (with respect to P) separable 
and measurable process on the same w space.t Hence, so long as the 
almost-everywhere valid properties of a given process are of interest, 
as in the case of this paper, the given process may as well be taken 
to be separable and measurable. Therefore, the Gaussian process 
{x,(w), ¢ € T} is henceforth assumed to be separable and measurable. 

(it) {a.(w), t € T} is sample (Lebesgue) square-integrable on T 
almost everywhere with respect to P.t{ 

This immediately implies that a Lebesgue integral 


* Symbolic distinction between the given process and its representation on the 
function space is made by explicitly writing the argument w for the latter. 

t Note continuity of the covariance function of a process is equivalent to con- 
tinuity in quadratic mean of the process (Loéve,!° p. 470), and hence it implies 
continuity in probability of the process. Then, according to Theorem 2.6 in Doob,? 
pp. 61-62, there exists an equivalent separable and measurable process on the 
same space. 

t See Loéve,'® pp. 520-521. 
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eo) = f elo at 


exists almost everywhere, in which y(t) is any continuous function on T. 
Furthermore, since the sample Lebesgue integral of a process coincides 
almost everywhere with the Riemann integral in quadratic mean cri- 
terion,” and also the Riemann integral in quadratic mean criterion of a 
Gaussian process is a Gaussian (random) variable,t £(w) is a Gaussian 
variable. 


2.2 Formulation of Problem 


Let Fo,e,,---,4, and F1;1,,...,4, be two alternative Gaussian finite di- 
mensional distribution functions of a real separable and measurable 
process {x:(w), £ € T}, whose mean functions are identically zero and 
whose covariance functions, denoted respectively by 70(s,t) and ri(s,t), 
are positive-definite and continuous on T X T’. Let Po and P; be the 
Gaussian probability measures defined respectively by F,1,,...,2, and 
F\; 1,,..-,4, 0n the Borel field @,r of subsets of Q as defined previously. 
It is well known that Po and P; are either equivalent, Po = P,, or 
orthogonal, Po 1 P,.t 

Define a set function P, by 


PCA) — aPo(A) + (1 = a) P}(Q = A), A € Br, (37) 
where a is a constant, 0 < a < 1.§ Let A. € @r be such a set that 
PAE) So nk) for all A € @r. (38) 


Then, the problem of interest is to specify such a set Aq in terms of 
r1(w).|| 

Now, if Po = Pi, let f(w) be a Radon-Nikodym derivative of P, 
with respect to Po ; while, if Po L Pi, let H € @r be a set such that 
P\(H) = 0 and Pi(H) = 1. Then, it can be shown that the following 


* Henceforth, the ‘‘sample Lebesgue integral of a process”’ will simply be called 
the ‘‘integral of a process,’’ unless otherwise specified. A definition of Riemann 
integral in quadratic mean criterion is in Loéve,!® pp. 471-474. 

+t See Loéve,! p. 485. 

t See Hajek.1!:2 

§ P. is the so-called error probability. Although 0 < P. S$ 1 for all A € @7, 
P.is not a probability measure, and its full meaning is given in Section 1.1. 

Equivalence between this problem and that of ‘‘optimum reception of binary 
Gaussian processes’”’ is discussed in detail in Section I. 
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set S, satisfies condition (38) :* 
ifPo= Pi, Sa= {f(w) 2 a/(1 — a}, 
Py: hs Py Dati: 


Thus the above stated problem is reduced to that of finding 

(7), if Po = Pi, a function of x:(w) equal to f(w) almost everywhere 
with respect to Po and P;, and 

(iv), if Po 1 P:, some such set H expressible in terms of x ;(w). 


(39) 


2.3 Solutions — I 


2.3.1 General Solution 


Let {7,} be a sequence of points in 7’ = [0,1], which is dense in T. 
Let ®, be a Borel field generated by a class of w sets of the form 


{x,,(w) = Pi, t= , ee me (40) 
and Ict 8, be the minimal Borel field containing U @,, . Obviously, 
n=1 


GC BC: CB, CBr. (41) 


Then, since {x:(w), ¢ € T} is a separable process, continuous in proba- 
bility (with respect to Po and P,), and the sequence {7;} is dense in 7’, 
it follows that, for an arbitrary set A € @7, there exists a set A’ € @, 
such that 


P)( AAA’) = 0 = P,( AAA’). (42)t 


Now, through the use of the covariance functions 7o(s,t) and r1(s,t) 
and the fact that the mean functions are identically zero, the density 


functions po and p, of the random variables 2,,(w), 7 = 1, --:, 7, 
corresponding to Py and P; respectively, are obtained as follows: 
Dnlr, vee, Yn) = (Qr)"” | R.’” ee 
noon (43) 
x exp \-} oy oe KG eas aa vo, m= 0, Ms 
ti=1 7=1 
where the 7; ,7 = 1, --- , n, are a finite subset of {7x}, and R,”,m = 0. 


1, aren X n symmetric, positive-definite matrices defined by 
Caraga = Taltes Ti) 5 m= 0, L; 4,9 = 1, vet yg N. (44) 
* See Appendix A. The first assertion of (39) follows from Corollary 1 in this 


appendix, while the second assertion is self-evident. 
t See Doob,® pp. 51-55; in particular, Theorem 2.2 (7). 
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Then define a random variable l,(w) by 


1 
2 


Lilo) = pilx,,(w), -++, t,(w)] = | Ry (R”)7 
a Doltr,(w), -+, tr,(w)] | ( ) 








(45) 
1 noon te ne 
X exp ‘5 > 2 (Ro) — (Ri) ‘hira(w)2n(o)}, 
t=1 j= 
Note that 
ln(w) 2 O forall n. (46) 


Furthermore, since R,,“”, m = 0, 1, are positive-definite, p, = 0 when- 
ever ~o = O and vice versa. Then, it can be shown that the processes 
{In(w), n = 1} and {1/l,(w), m = 1} are martingales with respect to 


Po and P, respectively .* 
(7) Po = Py: Let Eo{f(w)| @,}, n = 1, 2, --+, be a conditional 
expectation of f(w), given ®, , with respect to Py. Namely, 


f Bots) | Ga} dPy = [ fle) dPy for amy AE Gn. 
Then, 
In(w) = Eo{f(w)| Bn}, — a.e. (Po), T (47) 
and, from (41)t 
ie Eo{f(w)| Bn} = Eolf(w)|@,,}, ae. (Po). (48) 


Yet, from the definition of Ho{f(w)| @,} and (42), 
Egtf(w)| @.} = f(w), — a.e. (Po). (49) 
Hence, 


lim ln(w) = f(w), a.e. (Po). (50) 


Since Py) = P, the above implies 


lim ln,(w) = f(w), a.e. (P;). (51) 


Thus, the desired function, which is equal to f(w), a.e. (Po, P:), is 


* See Doob,® pp. 91-98. 

T ‘‘a.e. (P»),’’? m = 0, 1, is used as a shorthand notation of ‘‘almost everywhere 
with respect to Pm .’”’ Similarly, ‘‘a.e. (Po , Pi)”’ will be used to denote ‘‘almost 
everywhere with respect to both Po and Pi.” 

ft See Doob,? p. 331. 
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L,,(w), which is defined by 
L(w) = lim I,(w). (52) 


(it) Po L Py : From (46), lim I,(w) < ©, ae. (Po).” In fact, it can 
be shown that ass 


lim 1,(w) = 0, a.e. (Po).T (53 ) 


By using the same argument, it follows that 


lim [1//,(w)] = 0, a.e. (P;). (54) 


Hence, for an arbitrary constant c > 0, 


Poflim 1,(@) 2 c} = 0, P,f{lim l,(w) 2 c} = 1. 


Thus, the desired set H, with Po( 1) = 0 and P(A) = 1, is 
H = {liml,(w) 2 a/(1 — a)}. (55) 


In summary, upon combination of (52) and (55) in conjunction with 
(39), the desired set S, is 


S. = {lim l,(w) 2 a/(1 — a)}, (56) 
irrespective of whether Py = P, or Po L Pi. 


2.3.2 Special Solutions and Summary 


Under certain restrictive conditions, the set Sa can be specified in 
terms of well defined functions of x;(w). It is the purpose of this sub- 
section to obtain such specifications as well as the accompanying con- 
ditions in terms of the given covariance functions ro(s,t) and r;(s,t). 

(2) If Po = Pi, it has already been shown that 


Sa = {l,(w) 2 a/(1 — a)}. 


Thus, it is of interest to obtain a condition for Po = P, .t 
Define 


1n(w) ad [1,(w) i 1] log In(w), = uF 2, nce (57) 


* See Doob,® p. 319; Theorem 4.1 (2). 

+ See Doob,® pp. 345-346. 

t Such conditions are already available (e.g., Parzen,’ Shepp!*). For more de- 
tail, see Yaglom." 


RECEPTION OF BINARY GAUSSIAN SIGNALS 2785 


Then, since (p — 1) log p, p > 0, is a real, continuous and convex 
function of p and Mo{|(In(w) — 1) log li(w)|} < ~,n = 1,2, -:-; 
{nn(w), m 2 1} constitute a semi-martingale (with respect to Po).* 
Hence, Ho{yn.(w)}, n = 1, 2, ---, forms a monotone nondecreasing 
sequence 


Eo{m(w)} S Hofne(w)} S++: , (58) t 
which must either converge or diverge. Then, according to (53), 
if Po tL P,, then 
lim Eo{nn(w)} = ©. (59) 


Hence, since Py and P, can be either equivalent or orthogonal, it follows 
that 


Po S P; ) if 
lim Eo{mn(w)} < ©. (60) 


It can be shown that the converse of (59) is also true,f i.e., 


if lim Fol nn(w)} = 0, then Po na Py . (61) 


This implies that the condition of (60) is also necessary. Thus, through 
substitution of (45) into (57) and application of (43) for expectation 
calculation, § 


Po = P,, if and only if 
lim tr [(Ro) 7 Ri” — 22 + Ro” (R”)"] < ~. ~~ (62)| 


where Ro” and R,'” are defined in terms of ro(s,) and r,(s,t) by (44), 
(it) Examination of (45), (50) and (51) indicates that, in addition to 
condition (62), if 


lim | Ro” (Ri) "| = 8, O<B<-@, (63) 


then 


* See Doob,® pp. 295-296, Theorem 1.1 (iii). ‘Zo’? denotes expectation with 
respect to Po , namely, an integration over © with respect to Po . 

t See Doob,?® p. 324, Theorem 4.1s. 

It See Hajek;!? in particular, Lemma 2.1. 

§ For this calculation, use the following equality: Ho{y,.@)} = Eiflog l,(@)} — 
Eollog ln@)}, m = 1, 2, ---. 

| ‘tr’? denotes “trace,” and J is the n X n identity matrix. 
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n 


lim >) Dy [(Ro”)* — (Ri) Jaz 2,(@)a1,(@) < 0, 
noo t=1 g=1 (64) 
a.e. (Po, P1). 
Thus, by defining 6(w) as the above limit, i.e., 
A(w) = lim DY DY [(Ro”) 7 ~ (Ri) Jats, (w)ar,(w), (65) 


n>o t=1 s=1 


the set Sa € @r can be specified as follows: 


Sa = {0(w) = log (1/8) (a/(1 — @))*}. (66) 


It will now be shown that two conditions (62) and (63), required 
for the above specification of S,, are equivalent to the following pair 
of conditions: 


lim tr [(Ro”) 7? Ri” — I] < @, 


and (67) 
lint [Re (ey SS ae es 
Define 
Ww Ww (68) 
fn’ (w) = In(w) log ln(w) n=1,2,::- 
Thus, 
Nn(w) =a fn’ (w) ais fn(w), n= 1, 2, es (69) 


Again, just as in the ease of 7,(w), both {f,(w), n = 1} and {¢f,’(w), 
n = 1} are semi-martingales with respect to Py , and 


Eo{fi(w)} S Hoffo(w)} S ---, 


(70) 
Egihi'(w)} S Hoth’(o)} S - 
Furthermore, from (53), 
i. Pele Pers then lim Eo{fn(w)} = o. (71) 


N>oO 


However, from (69) and (70), divergence of Ho{f,(w)} implies that of 
Lyf nn(w)}. Hence, according to (61), the converse of (71) holds. Then, 
again from (70) and the equivalence-or-orthogonality dichotomy of 
Po and P, ) 
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Py) = P, if and only if lim Eq{fn(w)} < o. C72) 


Thus, upon substitution of (45) into (68) and application of (48) 
for expectation calculation, an alternative necessary and _ sufficient 
condition for Py = P, is obtained as follows: 


lim flog | (Ro) Ri” | + tr [Ro (RY) — TI} < 0. (78) 


Now, under the condition (63), the above condition implies that 


lim tr [Ro” (Ri) — I] < &. (74) 


Nn >eo 


Then, upon combination of conditions (62) and (74), condition (67) 
immediately follows. 

The result of this section may be summarized as follows: 

(2) In general, 


Sa = {lim In(w) 2 a/(1 — a)}, 


where /,(w) is defined by (45). 
(77) If Po = P,, which is true if and only if 


lim tr [((Ro”) 7 Ri — 22 + Ro” (Ri) 7] < @, 
then lim /,(w) = f(w), a.e. (Po, Pi); thus by defining /,,(w) = lim I,(w), 


Sa = {l,(w) 2 a/(1 — a)}. 


(iii) if 
lim tr [(Ro”) 7 Ri” — I] < ~, 
lim tr (Ro (Ri) — I] < &, 
then 


Sa = {6(w) 2 log (1/8)(a/(1 — @))*} 
where 6(w) and @ are defined by (65) and (63) respectively. 


2.4 Solutions — II 


2.4.1 General and Special Solutions 


Let \1 2 A» 2 --- and y(t), w(t), --- be the eigenvalues and the 
corresponding orthonormal eigenfunctions associated with the covari- 
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ance function 7o(s,t).” Similarly, let u, = uw. = --- and g(t), g(t), «+> 
be such eigenvalues and eigenfunctions associated with r,(s,t). Then, 
according to the discussion in 2.1 (27), continuity on T of each y(t) 
implies that the integrals 


g(a) = f awd, §=1,2,-% (75) 
T 
exist a.e. (Py, P,), and are Gaussian random variables. In fact, it 


can be shown that the density functions fo and pf; of &(w), «++ , &n(w) 
corresponding to Py and P, are given byt 


Balm, s+ yan) = (2) 1 Qn” [F 
ce ee (76) 
exp -3 PDO DCP eal vo} » m=0,1, 
@=1 j=1 
where Q,°”, m = 0, 1, are n X n symmetric and _positive-definite 
matrices defined by 
(Qo az = Aaday 5 (Qi) i5 = p> MiUniUr; y (77) 
where 
wy = | oeltys®) at (78) 
Let ®, be a Borel field generated by a class of w sets of the form 
{E:(w) Spi,t= Lot 5 U5 (79) 


and let &,, be the minimal Borel field containing U @, . Obviously, 
n=) 


Qi CRC: CR CBr. (80) 


It can be shown that, for an arbitrary A € @,, there exists some A € &,, 
such that 


Po( AAA) = 0. (81)t 


Now define a random variable {n(w) by 


* More precise definitions of these eigenvalues and eigenfunctions are given 
in Appendix B. 

t See Appendix C, 

t See Appendix D. 
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Piléi(w), Gy £,(w)] 
Polk (w), caer £,(w)] 


* exp ‘3 > > [(Qo!”)* (82) 





i,(w) = 


ad | O°” (Q” as 





= (Q1)]:5€:(w) 7) ; 


where (76) is substituted for the second equality. Again, note that 
in(w) is nonnegative for all n and also the fact that , = 0 whenever 
po = O and vice versa since Q,,°”, m = 0, 1, are positive-definite. Thus, 
again the processes tie); n = 1} and {1/in(w), n = 1} are martingales 
with respect to Py and P; respectively. 

By following step-by-step the same procedure as the one in the 
preceding section,” the following results are obtained: 

(7) In general, 


Sa = flim l,(w) = a/(1 — a)}. (83) 


(27) If Po = P, which is true if and only if 
lim tr [CQs°")7 Q. — 2] a O.” (Oy) Pe oO, (84) 


then 
lim i,(w) = f(w), ae. (Po, Py); (85) 
thus by defining 
L,(«) = lim in(w), (86) 
Sa = {i,(@) = a/(1 — a)}. (87) 


(iii) If 
lim tr [(Q°”)7 2° — I] < @, 


88 
lim tr [Q0°” (Q,)7 -— 1] < @, a 


then there exists a constant 6,0 < 8 < ©, such that 


: * In effect, it amounts to replacing @, and 1,@), n = 1, 2, --- n, by @ and 
l,(w) respectively. 
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lim | Qo” (Or")* | = B; (89) 
and, from (85) and (82), it follows that 
Him DY (QI = QU Nakilw)ELe) <=, fa 
a.e. (Pa, Py); 
thus, by aca 6(w) as the above limit, 
= {6(w) 2 log (1/8)(a/(1 — e))*}. (91) 
2.4.2 Integral Expression for 6(w) 


For the purpose of physical application, it is desirable to express the 
random variable 6(w) as a simpler function of z;(w), in particular, with- 
out involving limit operation. Examination of the definition of 6(w), 
1.e., 


6(w) = lim 3 = (Qo) = (Qi) Yisé(w) E(w), (92) 


n>o t=] j7=1 


indicates that 6(w) might be expressible as a quadratic form in 2;(w), 
Le., 


ie) e(w)h(s,t)t1(w) ds dt 


if such a square-integrable function h(s,t), (st) € T X T, exists and 
can be determined uniquely. It is the purpose of this subsection to make 
the above statement more definite and precise. 

Define an n X n symmetric matrix H™ by 


H™ = CO = (O°). 
Then, 
QP? Fo,” = Oe 45 On” 
or, through (77), the equation for the z-jth element becomes 
> NG (A) 5(O a = (Qi 4 — d,6 ijy 2, J = 1. vee yn. 


In other words, every ith row of H™ satisfies the following system of 
equations:* 


ys jnhix\” = b;(t) J = 1, vee yn, 
k=1 


* Note that the solution is unique, since the matrix (a,;) is positive-definite. 
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where 
aj = DS Kuen; 
ir nS he accor (93)* 
bs(t) = (@iz/Ni) — 5; 
or its standard form 
hi” = Voegha® +0,(4), g=al,-s+,n, (94) 
k=1 
where 
Cy = Oi5 — aij. 
Now, for each z = 1, 2, --- , consider the following infinite system of 
equations: 
hi; = PB Cirlin + b;(2), 7] => 1, 2; BRE? (95) 


According to the theory of infinite systems of equations,? if (95) has a 
solution (hi , he, ---) foreach? = 1,2, --- , such that 


> 3 hie < %, (96) 


then (hit, hie, +--+) is unique and 


hij = lim his”, J = 1, 2, aan: (97) 
for each 7 = 1, 2, ---, where (ha, «++, hn”), 7 = 1, «++, 7, is 


the solution of (94); provided that (95) satisfies the following condi- 
tions: for each z = 1, 2, -:-, 


2 lea | <1, (98) 
j=1 
and there exists a constant K; > 0, independent of j, such that 
ods K(1- Slewl) Fa LA. — (99) 


On the other hand, if (hi, hig, ---) is a solution of (95) for each 
7= 1,2, ---, satisfying (96), then the following integral equation 


j : Meni dhe=neD SAG). 00) 


* Note that (Qi™);; = ai; 37,7 = 1, 
t See Kantorovich and ais pp. 90-33. 
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has a square-integrable solution h(s,t), 


[J h'(s,t) ds dt < «, (101) 
TT 
such that 
(st) = 2) do hii(s)¥i(t), in the mean.* (102) 
w=1 7=1 


Conversely, if h(s,t) is a square- -integrable solution of (100), then (95) 
has a unique solution (hi, , hig, ---) for each 7 = 1, 2, --- , satisfying 


hi? < «©, such that 


ae 


Me 


ll 
- 


i=l 7 


his = | | vls\Als,0¥sO as det (103) 
TT 
Now, extend the definition of h;;‘”, 7 = 1, 2, --- , n,t by adding 
big” = 0; 4gantint+-. (104) 


Then, (90) and (92) can be rewritten as 


n>o i=] 


lim > ya” E(w)E(w@) < 0, ae. (Po, Pr), (105) 


and 


A(w) = Tim Dy 2) his &:()Ej(e). (106) 
According to the theory of coordinate and projective limits in sequence 
spaces, § (97) and (105) imply that 


(SSS TIMOR ae(PrsPa:. - Gos 


i=1 g=1 


since 


D Ve (w)Ew) <@, a0. (Po,P:). (108) 


7=1 j=1 


On the other hand, from (102) and square-integrability, a.e. (Po, P:), 


* See Appendix E.1. 

{t See Appendix F.2. 

t Namely, (hi, +--+ , hin) is the solution of (94) for eachz = 1, 

§See Cooke,!® pp. 282-289; in particular, Theorem (10.3, IT), "etieaded to the 
case of double’ sequences. 
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of {r:(w), t € T}, 
[ff ssloyn(op)an(w) ds at = LS huski(w) Ce), 
i=l j= 


a.e. (Po, P:). 


(109) 


Hence, 


io [[ a@ropee) deg Hee PY. 410) 


2.4.38 Discussion and Summary 


Recall that, in order to specify the set S. € @®r for given a as (91), 
it is sufficient to assume (88), which assures existence of 6(w) and ~ 
defined by (89) and (92) respectively. Moreover, in order to express 
6(w) as (110), it requires the additional assumptions that (7) the 
integral equation (100) have a square-integrable solution and (72) 
the conditions (98) and (99) be satisfied. 

It can be shown, however, under the assumptions (2) and the fol- 
lowing: 


ay <1, a7=1,2,---, (111) 
the conditions (77) and (88) can be replaced by the following: 


ai > >; | Qs; I, = i a. pear ee (112)* 
j=1 


and that there exists a constant K > 0, independent of 7,7 = 1, 2’ 
+--+, such that 


\(ais/As) = 8:5| S K(aus — D1" Jan), (113) 


where a;; is defined by (93).t It is quite possible that, once the condi- 
tion (2) is assumed, the conditions (111), (112) and (113) may be 
superfluous. That is to say, in some special cases, if the integral equa- 
tion (100) admits a square-integrable solution h(s,t) it may be pos- 
sible to prove directly that 


n>o t=1 


(sgt) = lim D> Yo has Val ssl) (114) 


in the mean, which immediately implies (97) and (105), thus establish- 


* The prime on the summation sign symbolizes omission of the term j = 7. 
t See Appendix D. 
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ing (107) and leading to (110). However, in the general case, establish- 
ment of (114) does not seem possible, nor does finding a sufficient 
condition for (114), without making the resultant condition excessively 


implicit and complex. 


2.5 Summary 


If 
| ro(s,u)h(u,v)ri(v,t) du dv = ri(s,t) — ro(s,t) 
TVT 
has a solution h(s,t), 


[f h’(s,t) ds dt < , 
TT 


then the set S, € ®r, given a (0 < a < 1), can be specified as 


Sa = wee t.(w)h(s,t)a,(w) ds dt = los 3 ¢ “ ae (115) 





where 
B = lim | Qo” Oey ls 
and 
(Qe) éi = ida; , (Qi) ij = Aij; gH l,i, 
and 
aii = a MEURiUK; 5 Lp A Tre, 
k= 
Us = [ oilt)ys(t) dt; 
where Ni = Ae = mt eg W(t), yo(t), ne and My es He = Sy gi(t), 
g(t), ---, are the eigenvalues and the corresponding orthonormal 


eigenfunctions associated with the given covariance functions 79(s,t) 
and r,(s,t), which are positive-definite and continuous on 7 X T; 
provided that 

(1) ai¢<1,0=1,2,-°-, 


(2) ax > Do’ | ais |, -= 1, 2)-+<, 
j=l 
(3) the following is bounded uniformly in 7,7 = 1, 2, ---: 
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Qi; | 

OE 

go is Fee 

1 — dD) | b% — ai | 
k=1 
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APPENDIX A 


Theorem on Optimality 


Let Po and P, be probability measures defined on a Borel field @ 
of subsets of an abstract space 2. Through the use of Lebesgue decom- 
position theorem and Radon-Nikodym theorem:* for a nonempty set 
H € 6 with P,(H) = 0, there exists a nonnegative function f(w) in- 
tegrable over 2 with respect to Po such that 


P,(A) = i. f(w)dPy + P(A NH) (116) 


for an arbitrary A € @.T 
Theorem: For an arbitrary constant k > 0, define a set S € ® by 
S = {f(w) 2 UH. (117) 
Then, 
kPo(S) + Pi(S°) — kPo(A) — Pi(A°) S$ 0 (118) 


for an arbitrary set A € ® where S° and A‘ are the complements of S and 
A with respect to Q. 


Proof: 
Put p = kP)(S) + Pi(S°) — kPo(A) — P(A‘). By adding and 
subtracting kPo(S M A) and Pi(S° f A’), 


* See Loéve,!° pp. 180-132. 
{ This paragraph closely parallels Grenander,! pp. 209-210. 
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p = k{Po(S) — Po(S NM A)] + Pi(S*) — Py(S° a) A*) 
— k[Po(A) — Po(S 1 A)] — Pi(A°) + Pris’ 1 A’) 


(119) 
= kP,(S NA°) — PiVS NA°) + PiCS° NA) 
— kP,(S° NA). 
From (116) and (117), with k > 0 and Po(H) = 0, 
P,GS NA°) — PGS’ NA) 
- [ngel(odaPo ~ Joon, fooaPr (120) 


+ PS NA° NA) — PS’ NANA) 
kPo(S N A‘) kPo(S° N A), 


IV 


since 
PS’ NANA) 
=P,({f(o) <i} NA NANA) S Pi(A’NA) =0. 
By substituting (120) into (119), 
p <0, 
which proves (118). (Q.E.D.) 


Corollary 1. Suppose Py = Pi, and let k = [a/(1 — a)],0 <a < 1. 
Then, a set Sq defined by 


Sa = {f(w) 2 a/(1 — a@)} (121) 
has the property expressed by (118), 7.e., 
aPo( Sa) + (1 — @)Pi(Sa°) S aPo(A) + (1 — a)Pi(A*) (122) 
for an arbitrary A € B. 


Proof: 
Note that Po = P, implies P;}(H) = 0. Hence, in (118), 


kPo(S) + Pi(S°) = [e/(1 — e)JPo(Sa UH) + Pi(S. U H) 
= [o/(1 a a)|Po( Sa) =e Fal Sa) 


Thus, substitution of the above into (118) and multiplication by 1 — a 
proves (122). 


Corollary 2. Take Q to be R, , an n-dimensional Euclidean space, and & 
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to be Borel field of right semt-closed, semi-infinite intervals in R, , denoted 
by Fn . Let Pm(X1, +++ ,2n),m = 0, land (11, --+,2n) € Ra, be Batre 
density functions corresponding to Pm,m = 0, 1; 2.e., 


Pin{ Xi <S pi,t = I, -++, d} 


pl Pn (123) 
-[ vf dx, +++ Atn Dm(%1, +++, In). 


Suppose pi(t1,°+* ,2n) = O whenever p(az,°+°+,%n) = 0. Then 
San defined by 





San = (Pile, «--124) te) ah 28 
Po(%1,°°*;%n) Ll—a 


has the property expressed by (122).* 


Proof: 
Note that Pp) = P,, thus Pi(H) = 0. Then, from (116), 


Flan <5 ty) = BAPE ated a.e. (Po, P1) 


Hence, apply Corollary 1. 
APPENDIX B 


Preliminaries on Integral Operatorst 


Let L be an integral operator with a real, symmetric, continuous and 
positive-definite kernel r(s,t) defined on the rectangle 7 xX T where 
T is the closed interval [0,1]. That is, 


1g) = f rs, Of(6) as, (124) 


where f(¢) is an arbitrary real-valued function in the space of square- 
integrable functions on 7’, which is symbolically denoted by £2(0,1), 
or simply by £2. 

Then, according to the theory of linear operators, all the eigenvalues 
of L are positive, of finite multiplicity, and finite or denumerably in- 
finite in number. Thus, counting each eigenvalue as many times as its 
multiplicity, we can construct an ordered sequence of eigenvalues, 

* This replaces the Neyman-Pearson theorem in the classical theory of testing 
simple hypotheses when the criterion changes from the Neyman-Pearson’s to 


the minimum error probability. See Cramér,!8 pp. 529-530. 
t See Riesz-Nagy,!” pp. 227-246. 
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My 2 de Beet, (125) 


and the corresponding sequence of orthonormal eigenfunctions (using 
the Gram-Schmidt orthonormalization process if necessary), 


W(t), Y2(t), “ete (126) 
Then, according to Mercer’s theorem, 
r(st) = 2, dbi(s)¥i(0), (127) 


where the series converges uniformly on 7’. Consequently, y;(t) is 
continuous on 7 for all 7, and 


di = f rit) dt< ©, (128) 
=] T 


4 


namely, the sum of all eigenvalues is finite. 
Furthermore, because of the positive definiteness of the kernel r(s,t), 
the set of the eigenfunctions {y;(¢)} forms an orthonormal basis of £». 


Let {y.(t)} be another orthonormal basis of £. Then, 
y(t) = ny uije:(t), in the mean, (129) 
where 
wis = | ous at, (130) 


which satisfies the following orthogonality conditions: 


> Uinje = [ ee dt = 6:;, 

: (131) 
na Ukilky = i Wilt)y;(t) dt = 6; . 

k=1 T 


APPENDIX C 


Density Functions of &:(w),7 = 1, ++: ,n 


It has been established in Section 2.4.1 that the random variables 
defined by (75), ie., 


2G) = L eloVdiidh. £12594, (132) 
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are Gaussian variables with respect to Pp and P,, where 
Peay OR a 0A... nd [ #2) di < 0, 
a.e. (Po, Pi). 
C.1 With respect to Po 
Through repeated use of Fubini’s theorem, 
BolE(a)} = f Holedla)} vil) dt= 0, £=1,2,--, (138) 
and 
Balei(e) &()} = ff [ Boles(o)2u(w) Wa(s)vs( ds at 


ies ro(s,t)Wi(s) w;(t) ds dt (134) 


I 


Aidiz 5 4,J = 1, 2, a) 
where Mercer’s theorem is used for the third equality. Then, since 
E(w), 7 = 1,-++, 7, are Gaussian variables. (133) and (134) immedi- 
ately give (76) and (77) with m = 0. 
C.2 With respect to Py 

By substituting (129) into (132), 


&;(w) = pz Uning(w), a.e. (Po ; P,) (135) 

where 
no) = [ wew)ei(t) dt, 2 = 1,2, ++," (136) 

T 
which exist a.e. (Po ,P1), and Gaussian variables just as &;(w), 7 = 
1, 2, --- , are. Then, the results in C.1 imply that 

FEy{ni(w) 1;(w)} = pidi; 5 1,J = 1, 2, a veh ag (137) 

Define 
&; (w) = Du was); J = 1, vr yn, (138 ) 


* Note 7:(w) here must not be confused with the one in Section 2.3.2. 
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and let F,°” be the distribution function of £°(w),-->, &°(w), 
and let fi (71, °°, tm), -© <1; < ©,7 = 1, °--, Nn, be its char- 
acteristic function with respect to P;, i.e., 


A (m+, te) = {exp E pD ie) |}, (139) 


Then, according to Levy’s continuity theorem,” lim F,™ exists if and 


M>o 


only if lim fi” (71, -++ , tn) exists for every r;, —© <7; < ~, and 
continuous at 7; = 0,7 = 1, ---, n; and, furthermore, when lim Ff, = 
F, exists, its characteristic function f\(71, --+ , tn) isequal to lim fa 
(m1, *°+, tT) for all r;, —7 <7; < ~,7 = 1, --:, nm. Hence, 
it suffices to obtain lim f,°” (71, -+- , Tn), namely, the limit of (139) 
as m — o, and to assure its continuity at the origin. 
By substituting (138) into (139), 
Gy ves, ™) = Ey fexp E 2d T; > ume) }} 
= fi; fexp bP in: (w) yu aa! 
m 1 
= II exp 5 Mk = TiUkj 
k=1 3 
= exp =~ oa TiTjLKUKiUk 
= k=1 i=l j=l 
Note that 
2 | xteniteny | = > ee | uxitte; | S DS ME < © ; (138) 
k=1 k=1 k=1 
since 


Unitles | = | | gn (t) y(t) a / gx (t)y;(t) a 


[oto al [oro a [v2 a] 


=]. 
* See Cramér,!® p. 102. 


IIA 
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Hence, 


n 


Me 


n n fo a) 
y TiTGFURURUG = os oe TiT; dX MEUKilyy » 


k=1 i=1 j=l i=l j=l 


] 
ba 


Then, putting 
(Qa ) «5 = De man 5 ae) = 1,+++ Nn, 
continuity of exponential functions implies 


lim ru (n, -*+,7,) = exp |-32 ~ (Qi aren], 


m-> oO 


? 


which is obviously continuous at 7; = 0,7 = 1, --- , n. Note that the 
right-hand side above is the characteristic function of the Gaussian 
distribution function with the density function (76) and (77) with 
m= 1. 


APPENDIX D 


Po — Equivalence between ®r and 8,, 


It is to be proved that, for an arbitrary set A € @,, there exists a 
nonempty set A € @&, such that Py(AAA) = 0. Note, however, that 
the above statement is equivalent to the following: 


Let ¥r C Br be a class of all sets A € By such that A € 7 implies 
existence of a nonempty set A € 6, with Po(AAA) = 0. Then, 7 = 
Br.” 


The second statement will be proved. 
D.1 For every t € T, 


tia) = Dy w)be(), — a.0.(Po. (139) 


Proof: 

According to the discussion in Section 2.1, (iz), &(w), k = 1,2, ---, 
are equal, a.e. (Po), to the Riemann integrals in quadratic mean cri- 
terion of 2:(w)¥x(¢) on T. Hence, from the proper orthogonal decomposi- 


*Tt must be proved first that such an F7 is not empty. This will be done in 
Section D.2. 
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tion theorem,* the series of (139) converges in quadratic mean with 
respect to Po to 2;(w) uniformly on T. Furthermore, &(w),k = 1,2, °°: , 
are mutually independent Gaussian variables with means zero and vari- 
ances A; , k = 1,2, ---+ , with respect to Py .} Hence, the series converges, 
a.e. (Po), to a limit for every ¢ € T since the series of its variances con- 
verges for every t € T,i.e., 


Y Blew) WW) = Lo rwalOvelt) = rol) < @, 


from Mercer’s theorem. Yet, since both the convergence in quadratic 
mean and the convergence almost everywhere imply the convergence 
in probability measure, this limit must be equal, a.e. (Po), to x:(w) 
for every ¢t € T. (Q.E.D.) 


D.2 Let Ar € @r be defined by 
Ge) S82 ea. (140) 
Then there exists a nonempty set A, € &,, such that 
Po(ArdAr) = 0. 
Proof: 
Consider a set defined by 


Ar = >> Ex(w)W(ti) Spi,t=1,°--, nf. (141) 
Clearly, Ar € &,. Define I; € @r by 
ry, = {ra(e) = awn}, te 7. (142) 


Note that (139) implies 
Po(T:) = 1, t € a. (148) 
Then it is self-evident that, for¢; € 7,7 = 1, ---,n, 


Arf (A rn) + Arf (u ri’); 
i=1 i=1 


Ar = Ap a (A rs) ae Ar a (0 Be), 


where I';,° is the complement of T,, . Note that, from (142), 


* See Loéve,!° pp. 478-479. 
t See Appendix C.1. 


I 


Ar 
(144) 


| 
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arn( A ra) = ae n(f ria) (145) 
i=] 


i=1 a 


and, from (143), 


P E N (u ru‘) | = 0 = Po E N (0 ri‘). (146) 


Hence, upon combination of 144, 145, and (146), 


P)(ArAAr) = 0. (Q.E.D.) 
D.3 F = Qr ° 
Proof: 

First, it is easily seen that the class §,7 is a field. Moreover, it will now 
be shown that §,7 is a Borel field. Let A; € ¥r,7 = 1, 2, ---. Then, 
from the definition of 7 , there exists A; € 6, such that 

P(A;AA:) = 0, t= 1,2,--°. (147) 
Define two sequences of null sets MW; and V;,7 = 1, 2, --- , by 

M;= A; — hi, Ni= Ai — A. (148) 
Then, 


Ay NSC ASG: AG UM;, @=1,2,--- 


Hence, 


ics 
>») 
| 

| C8 
= 
N 

ics 
i 
nN 

4 ‘ 

ics 
b> 
=. 
Cc 

——_~ 
ics 
S 

NILE 


eR 
° 


7 


which implies 


Thus, 


namely, 


2804 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


Furthermore, since 


NA; = (0 A) 
t=1 t=1 
and also §7 is a field, 
MA; € Fr. 
i=l 


Hence, S7 is a Borel field. 
Secondly, note that 57 contains the generating class of @7 as shown 
by (140) and (35). Hence, 


Sr Br. 
Yet, from the definition of ¥7 , 

Fr C Br. 
Therefore, 


Fr = Br. (Q.E.D.) 
APPENDIX E 


Equivalence between Two Equations 
E.1 Prelaminary 


Through Mercer’s theorem, 


ro(sst) = Do rabals)Wa(O), (149) 
uniformly. 
ri(st) = Do mver(sdon(2), (150) 
Then, 
[ a ri(s,t)Wils)y,;(t) ds dt = > MeURiUng = Aaj. 
Hence, 


(3) = 3 > aijWi(s)W;(t), in the mean. Cla) 


* This is a trivial extension of well-known results in the case of functions of one 
variable. A special case of (151) is found in Courant and Hilbert,!9 pp. 73-74. 
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K.2 Equivalence between Two Equations 


Equation (95) can be rewritten as 
= Nihides = Giz — Vida; , pe dl te, (152) 
where 7 = 1, 2, --- . Repeating (100), 
[ [ ro(s,u)h(u,v)ri(v,t) du dv = ri(s,t) — ro(s,t). (153) 


(a) If (ha, hz, +--+) is a solution of (152) for each 7 = 1, 2, --:, 
with 
pay 


t=1 j 


itt 


Pe Sea (154) 


then a square-integrable function vee with 


h(s,t) = >> hiwils)v;(t), in the mean, (155) 
i=l j=l 
satisfies (153). 
Proof: 
The left-hand side of (153) is clearly square-integrable. Hence, it 
has the following expansion: 


i / ro(s,u) h(u,v)ri(v,t) dudv = 5 > er Nihindijils)y; (2), 
T/T i=1 j=1 k=1 (156) 


in the mean, 


since, through substitution of (149), (150), and (155), 


[ i: i i ro(s,u)h(uv)ri(v,t) du av| vi(s)p;(t) ds dt 
(157) 


Mes 


Aline; 3 1,J = 1, 7 a 


x 
I 
_ 


Yet, by virtue of (hi,hi,--:) being a solution of (152) for each 


7 = 1,2, ---, the right-hand side of (156) becomes 
Dy dy Dd Ateintnsbils)¥i(D = 2D (az — dids)¥iCs)¥s( (158) 
i=1 j= = I= 


the right-hand side of which in turn becomes, from (149) and (150), 
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r1(s,t) = ro(s,t) 


Ms 
Ms 


(ai; =, Ali) Vils)Y;(), in the mean. (159) 


ll 
ba 
] 
un 


1 7 


Thus, upon combination of (156), (158), and (159),” 


[ rolsudh(uyo)ns (os) du dv = 7(s,t) — ro(s,t). (160) 


(Q.E.D.) 


(b) If h(s,t) is a square-integrable solution of (153), then 
(ha , hig, +++) satisfies (152) for each 7 = 1, 2, --- , where 


his = [ [ n(sOvils¥os(O ds at. (161) 
“oT 4T 
Proof: 
Since h(s,é) is square-integrable, it has the expansion of (155) where 
hij 3447 = 1, 2, --- , are defined by (161); thus (157) is established. 


Meanwhile, from (149) and (150), 


[ [eo — ro(s,t)|Wils)¥;(t) = aiz — Adis 


(162) 
ig =1,2,---. 

Then, combination of (160), (157) and (162) establishes 
>. Nhine; = aij — didi; . (Q.E.D.) 
k=1 

APPENDIX F 

Alternative Conditions 

Assume 

Gk-<— A, 4=1,2,-:-, (163) 


and the integral equation (100) 


| / PeMiinn@ONtoN— HED =A “des 


has a square-integrable solution.t Then, the conditions that 


* Note that, if a sequence of functions converges in the mean to two limits, the 
limits are equal almost everywhere. Furthermore, if the limits are continuous, 
they are equal everywhere. Note also that continuity of the left-hand side of 
(156) can easily be seen through the use of the Schwartz inequality. 


t Recall from Appendix E that this implies > ini > j-1 hi? < ©. 
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ai > DY lai; |, (165)* 
j=l 
and there exists a constant K > 0, independent of 7,7 = 1, 2, --- , such 
that 
| (a:;/Xz) a éy; | <s K (, — = | jk ), (166) 


imply the conditions (98), (99) and (88); namely, for each z = 1, 2, 


Pea <1 (167) 
7=1 
and there exists a constant K; > 0 such that 


(6) $ Ki (1- Dlenl), ee OP ae (168) 


k=l 
and finally 
lim tr [(Qo”) Qi” — I] < », 
lim tr [Qo(Q:”)7 — I] < &. (169) 
Proof: 
First, note that 
ai >0, t*=1,2,::-, (170) 


and 


ae eee (171) 


i=1 t=1 


For, from (93) and the fact that uw, > 0,k = 1,2,---, 


and, from (131), 


ioe] 
SS aii = 
i=l 


* The prime symbolizes omission of the term 7 = 7 in the summation. 
{ For justification of interchange of order of summation, see Apostol,?° pp. 
374-375. 


Mes 


ie) fe) co] 2] 
DS pete?” = Do oe DD me? = DD nt 
k=1 a=1 k=1 


i=1 k=1 
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Second, through (93) and (94) with (163), (165) and (166), 


foo] 


1 dele] =1— 2785 — as 


1 |i ~ ae] — Do" | ass | 


— 2 | ai; | > 0, 
j=l 


and 
b; } a r; Oi 
oe ae R72) ij =1,2,°°, 
1-— Dien) a5 — D' lay | 
k==1 k=l 
which prove (167) and (168). 
Last, note from the definition of hi; 7,7 = 1, 2, --- ,* 
tr (Gy) 0.” _ I] = > D3 hij” aiz = 2 yD his” aij ’ 
wl j= 1s j= 


Ms 


er [Qo (Qi) > = I) = hig r< Oy = »» hi r. 


I 
es 
ll 
” 


i=l j 


Yet, according to the theory of infinite systems of equations, for each 
= i 2, $0 8 39 


[his |S Ki, j=1,2,°°+.F 
By putting K; = K,i = 1,2,---, 
| his | < K; ij = 1,2, °°: 


Then, 
Se KEY lay! < 0, 
> [mgs | SK 3 di < @, 
since 
~* Recall: 


ae = T= (Qo) = (1), 
HO = (QM — QOPI sbi = Ay am 


ij=n lin + 2,--- 


t+ See Kantorovich and Krylov,!® pp. 26-27. 
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fo] 


2, 2, !e8l = ¥ (aw + Davi) <2 av = 2 Dw < oO, 


Hence, from (97), 


lim | tr [(Qo”)~* Q@” — 7) | 





ll 
= it 
g 8 























— ye higQij 
t=1 j=l 
[a ) 3 co 8 4 
2 
s (Sat) (S Sat) 
t=1 j=1 i=l j=l 
<o%, 
foo} 
lim | tr [Qo (Qi) — Z] | = lim | > hes 
N-»00 n>o | t=1 
= S. heids 
i=1 
00 4 ee) 4 
= (> het) & a?) 
i=l i=1 
< @, 
(Q.E.D.) 
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On the Spectral Properties of Single- 
Sideband Angle-Modulated Signals 


By R. D. BARNARD 
(Manuscript received May 11, 1964) 


The representation of single-sideband angle-modulated carriers as origi- 
nally given by Bedrosian is generalized through the functional and spectral 
notions of distribution theory. In this treatment the class of related modulat- 
ing signals is extended to rather general types of distributions, and spectral 
criterta and iteration algorithms are established by which such modulating 
signals can be recovered from bandlimited components of the modulated 
carriers. 


I. INTRODUCTION 


Among the more recent signal transmission techniques for conserving 
spectral bandwidth is single-sideband angle modulation, first proposed 
and investigated by Bedrosian.! In this scheme a carrier wave is simul- 
taneously angle modulated by an appropriate baseband (bandlimited) 
signal and amplitude modulated (multiplied) by the negative exponential 
of the Hilbert transform of the baseband signal, the combined modulation 
process resulting in an RF spectrum which vanishes identically on the 
low-frequency side of the carrier frequency and carrier axis crossings 
which coincide exactly with those of a conventional angle-modulated 
carrier modulated by the same baseband signal. The single-sideband and 
axis-crossing properties, although suggesting means with which to ob- 
tain ideal bandwidth reduction and compatible detection, are only par- 
tially applicable to physical systems.* In general, the RF spectra under 
the combined and conventional modulation schemes are of infinite ex- 
tent, and the nonvanishing portion of the spectrum under the former can 
have, according to any one of several common definitions, a larger effec- 
tive bandwidth than that under the latter; consequently, single-sideband 
angle modulation does not necessarily lead to bandwidth reduction, and 


* Detection compatibility is suggested by the fact that the output of an ideal 
limiter depends only on the axis crossings of the input. 
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the axis-crossing patterns of filtered versions of single-sideband and con- 
ventional angle-modulated carriers can differ appreciably. Nevertheless, 
Bedrosian has shown that the combined form of modulation, as pre- 
scribed, offers the possibility of both a reduction in the effective band- 
width over limited ranges of the angle-modulation index and detection 
compatibility. It is therefore of practical and theoretical interest to es- 
tablish criteria relating either directly or indirectly to the spectral prop- 
erties of such carrier waves. In the present paper we specify rather general 
signal conditions under which the Bedrosian scheme and the associated 
single-sideband property obtain, and determine spectral conditions un- 
der which knowledge of the RI spectrum over a frequency interval 
slightly wider than half the signal bandwidth provides enough informa- 
tion to recover the baseband signal up to an additive constant.* Signal 
recovery in this second category is effected by an iterative computation 
that cannot be carried out exactly in real time; however, the possibility 
of pure mathematical recovery based on a finite portion of the spectrum 
constitutes an important spectral property, indicating that the RF spec- 
trum, although infinite in extent, can be viewed theoretically as having 
an effective bandwidth equal to half the signal bandwidth. t These quali- 
tative results are now restated somewhat more explicitly. 

In precise terms, single-sideband angle-modulated carriers are gener- 
ally assumed to have the form 


y(t) = exp [—4(t)] cos [2xft + x(t)] 


where x, £, and y, represent respectively a specified angle-modulating 
signal, its Hilbert transform, and the modulated carrier, the first two 
functions being periodic or square-integrable, bounded, and bandlimited 
to some frequency interval [—fo , fo]. Modulated under these conditions, 
y- exhibits the two previously mentioned properties with respect to band- 
width and detection; viz., the corresponding amplitude spectrum (Four- 
ier transform) vanishes over (—f,,f-), and the axis crossings as well as 
the effects that they produce at the output of an ideal limiter coincide 
exactly with those of the usual angle-modulated carrier 


* Contrary to established usage, the term “‘bandwidth”’ refers here and through- 
out to the total frequency spread of the spectrum of the baseband signal over 
both positive and negative frequencies (cf. Section 2.2). 

+ Other problems and criteria pertaining to the recoverability of signals subject 
to nonlinear and bandlimiting operations have received considerable attention 
recently.?3 Beurling’s theorem, directly applicable to instantaneous compandors, 
is perhaps the principal result along these lines.4 ,In unpublished work, H. O. Pollak 
shows by means of Fredholm equation methods that under special conditions the 
baseband signal of a conventional FM carrier can be recovered mathematically 
from knowledge of the RF spectrum over an interval of twice the signal bandwidth. 
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41 = cos [2nf.t + x(t)].* 


To deal with more general modulating signals, i.e., signals which are 
neither periodic nor square-integrable yet to which the spectral concepts 
of Fourier transforms and the results above still apply, requires the 
theory of temperate distributions (generalized functions).5.*7 In this 
paper we treat both x and y, as special types of distributions and in- 
vestigate the feasibility of recovering the former from bandlimited 
components of the latter. Specifically, we: (z) generalize the definitions, 
concepts, and methods of classical Hilbert transform theory to incor- 
porate arbitrary distributions (cf. Section II, Definition 2 and Theorem 
3);T (72) extend the class of modulating signals to include all bounded, 
bandlimited distributions with bounded generalized Hilbert transforms 
(ef. Section III, Theorem 4); and (272) establish through a standard 
fixed-point theorem related subclasses for which the spectrum of y, 
over any open interval containing [f, , f- + fo] furnishes sufficient infor- 
mation for reconstructing derivative 2’(t) by iteration (cf. Section IV, 
Theorems 7-9).f{ It is intended also that this development illustrate the 
distribution-theoretic approach to be generally employed in connection 
with other modulation schemes. 


II, PRELIMINARIES 


As noted above, characterizing the amplitude spectra and spectral 
properties of the signals considered in this paper requires the theory of 
temperate distributions.* We discuss here four aspects of this theory: 
notation and terminology, bandlimited distributions, convolution, and 
generalized Hilbert transforms. 


2.1 Temperate Distributions — Notation and Terminology 


Let J denote a specified, open interval on the real line with /,, , [4.,, , 
and J_,, signifying respectively the intervals (—~»,«), (0,©), and 
(— 0,0); J, the closure of J; C*(/), the space of scalar functions of which 
the derivatives up to and including order & are continuous on J; and 
Ca, the space of “rapidly decaying”’ functions, viz., the linear vector 
space 


* In the first case the nonvanishing portion of the spectrum of y, is generally 
so smeared out as to have an effective bandwidth greater than that of y; . 

t For detailed examples of Hilbert transform applications in modulation theory, 
the reader is referred to the expositions of Rowe,’ Bennett,® and Dugundji.!° 

¢t Landau,? Miranker,? Sandberg,’ and BeneS!! have recently made extensive 
use of fixed-point theorems in a variety of system-theoretic problems relating to 
recovery and stability. 
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Ca = {ele € C*(1,), te (t) > O( |t| > ©) Wie = Of. (1) 


Also, a topology in C3 is introduced by means of the metric 


ee ulgr — g2,J,k — 7) 
paler ,g2) = py a ara (L+u@i—¢2,3h—-7))  (2)* 


¢1,¢2 € Ca 





where 
u(¢,j,k) = sup | to (t) |. 


For convenience the convergence of a sequence {¢,} relative to this metric 
is expressed as gy, > ¢y(yn,¢ € Ca). Since the series in (2) converges 
uniformly over such sequences, it follows that 


gn > 0 sup | t’y, (t) | —> 0 Wik. 
tel, n> 


As generally defined, temperate distributions are merely the elements 
of the conjugate space of Ca, i.e., the space of linear, continuous func- 
tionals on Cz." In the treatment below we represent this space by D 
and the corresponding elements by 2(-). Although mathematically dis- 
tinct, a distribution x(-) and an ordinary function x(-) for which 


[ ; Heda ae) Weide; (3) 


are regarded as characterizing one another, either form being essentially 
determined by the other. + To extend this notion, we associate every ele- 
ment « € D with a “generalized function” x(-) (cf. Ref. 6), viz., the 
totality of sequences {x,(-) } in Ca such that 


lim is an(t)e(t)dt = xe) Vo € Cu. (4)t 


As distributions and generalized functions are in one to one correspond- 
ence, it is common to employ all related terms and symbols interchange- 
ably. Also, the ordinary and generalized functions relating to (3) are 
considered to be equivalent in that both define the same element of D. 

In connection with the equality of distributions, let N[y] signify the 
null set of g € Ca, viz., 

* Space Cz constitutes a complete linear topological space in pa(-,-) (cf. Ref. 
12, p. : 
{ “Essentially” is used here to indicate that 2x(-) determines x(-) almost 
everywhere on /,, (cf. Ref. 5, pp. 1645-1646). 


t Sequences satisfying this condition can be shown to exist for an arbitrary 
distribution (cf. Ref. 6, p. 183). 
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Nig] = {t|¢ € Ul.; ¢(t) = OWE € L,I, 


and S[g], the support of ¢, viz., 
Sle] = 1. — Niel. 
Similarly, let Np[z] signify the null set of x € D, viz., 
Na] = {t|¢ € Ul; xv) = 0 We 9 Sly] GLa, ¢ € Ch, 


and S>[z], the support of x, viz., 
Sp[t] = I, — Nolz]. 


Accordingly, if for an interval J € J, and two elements x and y of D, 
I CNpI[x — yj], we say x(-) = y{-) and a(-) = y(-) on J. This defini- 
tion also allows one to equate generalized and ordinary functions on 
arbitrary intervals; that is, if 


np) = i v(t) p(t) dt (5) 


for some v and all g € Cz such that Sly] C I, then J C Np[x — v] and 
x =vonl. 

Among the standard operations associated with distributions, five re- 
quire special notation: 

(7) Products. With respect to any two distributions x and y of which 
at least one, say y, characterizes an ordinary function y(-) such that 
y(-)e € Ca Vo € Ca, let xy(-) (and yx(-)) denote the product of x and 
y given by 


xy(e) = yx(e) = xyp) We € Ca, (6) 


and let x(-)y(-) (and y(-)a(-)) denote the related generalized func- 
tion.° 
(22) Derivatives. For any x € D, let p"x(-) denote the nth order deriva- 
tive of x given by 
p’x(y) = (—1)"#(e") We € Cu, (7) 


and p”x(-), or (d”/d(-)”)x(-), the related generalized function.° 
(at) Antidertvatives. For any x € D, let 


[ %«) 
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denote any nth order antiderivative of x satisfying 


p’ | 2(e) = ale) We € Ca, 


and 


fe), 


the related generalized function. All nth order antiderivatives of a par- 
ticular element x € D can differ by only additive polynomials of degree 
n — 1 (ef. Ref. 7, p. 8). 

(iv) Limits. Distribution limits of the form 


lim ay) =2x~)€D Wee Ca 


are represented in terms of generalized functions by 
lim™ a(-) = x(-). 
r 
(v) Fourier Transforms. Foranyx € D, let &(-) denote the generalized 
Fourier transform of x, viz., the distribution given by 
iy) = xF-y) Wee Ca (8) 


where 
Pee=[ Qed i= SH, 


and let #(-), or F-x(-), denote the related generalized function (cf. 
Ref. 6, p. 188). For the right-hand functional in definition (8) to exist, 
it is required that F'-g € Ca, a condition which holds for all g € Ca. 
Rewriting this relation yields the more suggestive form 
Fay) = «(F-¢). (9) 
Similarly, 
F™-a(g) = «(F'-) 


where 
F =| g(t)e dé. 


One property pertaining to operators lim™, p”, and F is of paramount 
importance in applications of distribution theory: the last two commute 
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with the first.° The reader is referred to the previously mentioned litera- 
ture for a detailed discussion of these and other operations as well as the 
various terms outlined above. 


2.2 Bandlimited Distributions 


Let f and J denote respectively a point and a compact set on the real 
frequency line /,, . A distribution x for which Sp[F'-2] C J (ie, F-2 = 0 
on any J disjoint from /) is defined to be bandlimited to J, the space of 
such elements being designated as 


B(J) = {a|x € D, Sp[F-2] C J}. 
Defining, in addition, the space 
C, = {vlv € CP); Wea 9 (1+ #) W(t) > OC [t] > &)}, (10) 
we establish the following 
Lemma 1: If x € B( J), then x(-) € Cy. 


Proof: Construct a real, positive function ¢(f) € Ca satisfying the con- 
ditions 


Jt $6.2 
rn = {f fEl Iigked 


and set 
v(t) = F-axe(fye*™"). (11) 


We consider first representing /’-x(-) on J; by an integral. For this it is 
necessary to employ the well known result that on any finite interval 
an arbitrary distribution can be characterized by a multiple derivative 
of some ordinary, continuous function; more specifically, there exist 
both a function y € C(J2) and an integer N = 0 such that 


Peale) = (=1)" | vNe™ Naf (12) 


for all y € Ca for which Sly] € I» (cf. Ref. 7, pp. 11-12). Inasmuch as 
S[¢] G Le, expression (11) becomes 


a” 


Qrift 
ayt [e(ner' laf, 


v(t) = i: (—1) Vf) 


which in turn gives 
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MO =f Hon Flenne nea 


Fea((Qrif) (he) = ott’) (lt| > @) 


for all k; therefore, v € C”(/,,). In order to identify v further, observe 
that for any k 


I 


(l+0)%(t) 0 3 (|t|> &) 


for some integer 7; consequently, by (10), € C,. 
Regarding the relationship between v and x we note that since 


[. i (-1) yn 2a te Deora a 


I 


ie »(t)y(t) at 


uy" f [vn 2 tenew@ertae ar 


I. (“DVD 555 7 [en ir “older | df 
= F-x(F"-e) We € Ca 
and since S[(¢ — 1)6] N Sp[F'-x] is empty for all 6 € Ca, 
ay) = F-a(F +e) + Fea((¢ — 1)F-9) 


= F-a(tF-9) = [| ved. 
Hence, in accordance with (3) et seq., v(-) = x(-) with v € C,. 


2.3 Convolution 


A convolution operation sufficiently general for most applications in 
signal theory is given by 


Definition 1: For any two distributions x and y of which at least one, 
say y, is such that F-y(-) € C, we define a distribution x+y, termed the 
convolution of x and y, by the relation 


ney = yar = F*-[ (F-x) (F-y) J. (13) 


As to the consistency of this definition, observe that with F-y(-) € C,, 
gF-y(-) € Ca for all g € Ca; therefore, according to (6) et seq., both 
the product (F-x)(F-y) and corresponding convolution exist as dis- 


* Interchanging the order of integration in this relation is justified by means 
of the Tonelli-Hobson theorem (cf. Ref. 13, p. 3). 
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tributions, and their factors commute. The associative and distributive 
properties of this operation depend in general on the factors involved, 
the results in any given case being determined directly from (13). One 
unportant consequence of Definition 1 is stated as 


Theorem 1: For any two distributions x and y of which at least one, say y, 
is such that y(-) € C, 


Fe(cy) = (F-ax)+(P-y). 
Proof: From 
aty(t)) = 2h F-p(—-t)) = F-F-xg(—-t)) We ED, Wee Ca, 
(6), and (13) it follows that | 
F- (ay) @(f)) = ry(F +e) = x{y(t)F-¢) 
= F-F-xly(—t)F +9) = F-F-x( (F-F-y(t)|F-¢) 
[(F-F-x) (F-F-y) ](F +9) = FU[(F-F-x) (F-F-y) ] (&) 
= [(F-x)«*(F-y) ] (). 


We show at this point that Definition 1 relates to a more common but 
less general form of convolution (cf. Ref. 7, p. 31). 


I 


Theorem 2: If at least one of two distributions x and y, say y, has a finite 
support (1.e., Soly] GI C I,,), then xxy exists, and 


rey(e) = x(y(v(t + t))) We € Ca. 


Proof: Reversing the roles of t and f in Lemma 1 demonstrates that with 
Sp[y] finite, 1.e., with y time-limited to J, F-y(-) € C, ; hence, by Defini- 
tion 1, v*y exists. In addition, from (13), (9), and (6) there obtains 


way(o(t)) = P+ (aay) (Fe) = [ (Fx) (Fey) ] U9) 
F-«( (F-y) (Fg) ) = 2(F-[ (F-y) (Fe) 1) 


x f et 8 -y) (Pg) alt) 


where the subscripts indicate a function of ¢. As the integral of this last 
functional proves to be linear and continuous on C4, 1.e., as 


[. (F-y) deo" 8 (Fe) ddé = F-y((F*-g) 6") € D, 


then 
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a(F-y((F +) ge") 
= o(y(P-[(F*-¢) 6°") 
x(y(y(t + i))). 


If defined by this expression instead of (13), convolution would not 
necessarily have commuting factors; e.g., with y(-) and x(-) equal to a 
Dirac delta function 5(-) and a constant, respectively, y(a(o(t + #))) 
is not defined because for this choice of x, x(y(t + #)) is constant and not 
an element of Ca.*'t 


ay (e(t)) 


2.4 Generalized Hilbert Transforms 


In this subsection we extend the applicability of classical Hilbert 
transform properties and techniques to arbitrary distributions. Required 
initially are two lemmas relating to antiderivatives. 


Lemma 2: Corresponding to all antiderivatives of an element F-x € D 
the distribution limits 


lim™ | tan” vr ff Fa (14) 


exist for some N = 0. 


Proof: Set I, = (—a,«a) andl,, = (—e,e) with0 <a < e < », 
and construct a real, positive function n(f) € Ca satisfying the condi- 
tions 


ah Fed 
nif) = {4 a emo Soe 


It is convenient to consider first the same type of integral representation 
as was used in Lemma 1 [ef. (12)]; namely, there exist both a function 
y € C(.,) and an integer N = 0 such that 


Peale) = (=1)" [ve Nar (15) 


* The Dirac distribution is given formally by the equation dy) = ¢(0). 
+ Commutativity can be forced in such cases by defining the convolution ac- 
cording to the form 


a*y(e) = aySo(@)e(t + 1))) 


where t corresponds to the distribution of finite support and where {.€ C4 equals 
unity over an open interval containing this support and vanishes outside some 
finite interval. 
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for all g € Ca for which S[y] € I, . In agreement with (7) and (5) et 
seq., relation (15) merely asserts that 

av 
2 = ape¥) on de 


and that for all antiderivatives, 


[Pex =yv(f) + = Onf” on f,, (16) 


where constants a, are arbitrary. Since S[ng] C I., for all g € Ca, Eq. 
(16) can be written as 


[ Pex) = f Ee +3, anf" | nDe(Dar 


€2 


Therefore, by the Lebesgue convergence theorem” 


lim i F-a((tan™ df) ne) 
nN N 
(17) 
ae / (sen f) [W(N) + DL ons" Do Nas 
with 
a eee 
On the other hand, since 


ion Ee (1 - ne | 


for allj, k, andy € Ca, 
(tan Af) (1 — n)e > (7/2) (senf) (1-—n)e A>», 
and 
lim f F-x((tan Af) (1 ~ ne) = § ff F-2((sen (2 — ade). (18) 
Finally, adding limits (17) and (18) yields 
z [ F-xX(sen NA ~ De) +5 fend WW + Dans" he af - 
= lim [ F-x((tan Af)e). 
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As both terms on the left are elements of D, the distribution limits given 
by (14) exist. 


Lemma 8: Corresponding to element x, integer N, and all antiderivatives 
of Lemma, 2 the generalized functions 


oi lim? | ean PT) is fr. 2 


differ by only the additive combinations 


N-1 
aT >, aan! 6(f) 
n=0 


where 6(f) represents the nth order derivative of the Dirac function. Fur- 
thermore, 6(f)"” ts the highest-order Dirac component which can exist at 


Proof: From (19) and (7) there results 


p'-lim® - [ F-a((tan™ aoe) = % F-2((sgn f) (1 = ade) 
+(=D"5 J (sen NNEC Ne(MM as (20) 


+ (=F J (sen f) D ans"Inel as, 


the last, only nonunique term reducing to 
mw >, agn!(—1)* "(0)" = & DS annlb(oy*", 


Inspection of the two remaining distributions on the right of (20) shows, 
in addition, that 6(f)“” is the highest-order Dirac function possible at 
f = 0; for the support of the first does not include the origin, and the 
second represents the Nth derivative of an ordinary, sectionally con- 
tinuous function. 

The preceding two lemmas lead immediately to 


Definition 2: For any distribution x we define a distribution #, termed 
the generalized Hilbert transform of x, by the relation 


4(-) = i= FO. \ oa sim | (tan Df F. | 





oe (21) 
+S pane 
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where No designates the smallest integer for which Lemma 2 holds and 
where constants 8, are constrained so as to eliminate from F'- 4 (or to pre- 
scribe) all Dirac distributions at f = 0. 

As regards ordinary Hilbert transforms it is noted that if 


F-x(-) € In(T,,) (square-integrable), 
then 
lim™ (tan Af) F-x = (/2) (sen f)F-a. 
. 


Consequently, No = 0 and 
&(-) = —iF *-{(senf)F-2}, 
a formula which is in agreement with classical theory (cf. Ref. 15, pp. 
119-120). 
Denoting the linear mapping of (21) by H (i.e., H:D — D), we list 
a few of the more significant properties of generalized Hilbert transforms: 
(¢) H-H-x = —« provided there exist in F-x no Dirac components 
at f = 0. 
(17) H-zx is real provided z is real. 
(at) Sy[F-H-a] € SD[F-a]. 
These results follow directly from (20) and Definition 2. Of importance 
in single-sideband theory is the property given by 


Theorem 8: For any distribution x, 
SylF +(x + 14) ] © Lye 
and 
Sy[F- (a — i#)] © Le. 
That is, F- (x + 7#) and F-(x — t£) vanish on I_. and I+, respectively: 


Proof: Consider all g such that Sly] € I_,, ; then, 


(i? i (sen NUD + ons In Ne MPa 


= —p" | F-x(9e) = —F xno), 


and from (20) and (21) there obtains 
Fate) + iF -&(y) = F-x) — F-x((1 — n)e) — F-a(ne) = 0. 
Similarly, with Sly] C I,,,, F-2(y) — iF -#(y) = 0. 
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III. SINGLE-SIDEBAND ANGLE MODULATION (ssBOM) 


The notions and results of the previous section apply directly to signals 
classified as single-sideband angle-modulated, namely, time functions of 
the form 


ye(t) = exp [—&(t) ] cos [2af.t + a(t) }. 


It is the intent here to show that if the modulating signals x correspond 
to elements of the space 


So i {e12 ¢ B(1o), Lo = (—fo, fo); a |5| | < oo, x real}, 


functions y, characterize distributions, and have, as the term SSBOM 
suggests, amplitude spectra (Fourier transforms) which vanish on the 
interval (—f.,f.). We begin with three lemmas pertaining to exponen- 
tials and convolution. 


Lemma 4: Elements of the spaces 

Si ={yly =e",2 =a2+ 7é,2 € So}, 

S =f{vjvu=2,2z=a2+ i4,2€ S,N =O} 
are equivalent to generalized functions [cf. (4) et seq.]. 


Proof: Clearly, since Sp|F'-¢| € Sp[F-2x], both x and ¢ are bandlimited 
as well as bounded, and are, by Lemma 1, elements of Cg; hence, y is 
bounded on /,, and integrable over finite intervals, and 











. < 2 . y(t) 
| [veal s sw (a+ Het | [| On| a 2) 
We € Ci. 
This latter condition, however, implies that 
[ w@eloat > 0 (23) 


for gn — 0. Therefore, the left-hand integral of (22) constitutes a con- 
tinuous, linear functional on Cz, i.e., a distribution, and y is equivalent 
to a generalized function. Precisely the same argument applies to 
z"(N = 0), showing that this function is also bounded, integrable 
over finite intervals, and equivalent to a generalized function. 


Lemma 6: Forx € Sysandz=x+ 14 


N . n 
e* = lim yy sue : (24) 


Noo n= 
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Proof: Set 


tz 


é 


y 
N é 
_ y Gz)" 
alae > nt ° 
Then, by Darboux’s formula,!® 


Giz)" i: thz N 
— = 1 —rA)"dav 
y Yn N! 0 ent ya, 








and inasmuch as y and yy represent generalized functions (cf. Lemma 4), 





| ye) — ywe) | = is [y(t) — wo(Dle(Oae 
~ LF [ Hy, [ Fad Glee Wer at| 
< ri sup | 2""'e™ | ; : | e(t) | de 
S 55 (sup [2 |)" exp (sup | 41) 


‘[ le@ldi—s0 Wee Ge, 


a result corresponding to (24). 
Lemma 6: If two distributions g and h are such that 
Sol[F-g] & [0,fl, 
Sl[f’-h] € [0,frl, 
then 
Spf: (gh)] S [0A + fal. 
Proof: With respect to any ¢ € Ca for which S[y] © J_,, , set 
go(t) = F-hig(t + 2). 


As defined, go(t) = 0 for allt > 0; ie., 4,, & Nw] and Sly] G Iw 
Hence, by Theorems 1 and 2 


[F- (gh) te) = [(F-g)*(F-h)]te) 
= F-g(F-hig(t + t))) = F-g(eo(t)) = 0, 
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which yields 
Sy[F + (gh)] & 14... - (25) 


In a similar manner, consider any ¢ € Cz for which Sl¢] € [fi + fe, ©), 
and set 


gilt) = F-hig(t + t)). 


It follows that gi(¢) = 0 for allt € (—~,fi), ie., that Slo] € [fi , ©). 
Therefore, 


[F-(gh)\e) = F-glex) = 0, 
which yields 

Solf: (gh)] € (— ft + fal. (26) 
Conditions (25) and (26) prove that 

S,[F-(gh)] € [0,f1 + fel. 

The main result of this section is stated as 
Theorem 4: The amplitude spectra of generalized functions 
ye(t) exp [—&(t)] cos [2nfe + x(t)] x € So 

vanish on the interval (—f. , fc). 


Proof: Again, for x € Spandz = 2+ 74, Sp[F-z] © Ip and, by Theorem 
3, Sp[F-2z] C I4.. ; consequently, Sp[F-z] € [0,fo]. This condition com- 
bined with Lemmas 5 and 6 leads to 


Sp[F-e] 
a5 | Flim 3 | Lis im” = + F-2)"| (27) 
SU SiLF-[(é2) "ll C Taw. 


On the other hand, for 2 = x — 7# 
Sy[F-e"] © Le. (28) 


Finally, since F’-[e*’*‘y(t)] = o(f * f-.) for F-y = g, then (27) and 
(28) give 


So[F’ yell = Sy[F- (e%e? 70! zi ae aa) Cc [fe 20 ) U = 0 ,—fel, 
or, equivalently, ’'-y, = Oon (—f.,f.). 
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IV. SIGNAL RECOVERY FOR SSBOM REPRESENTATIONS 


In this section we treat the problem of reconstructing signals x € Sy 
from bandlimited versions of the associated SSBOM functions y, . 
Specifically, it is demonstrated that for a large subclass of So , knowledge 
of the amplitude spectrum of y, over any open interval containing 
[fe , fc + fo] proves sufficient to recover x up to an additive constant. As 
in the previous section, several lemmas involving the exponential e’’ are 
developed first. To collect notation, we set 


z=ar+1% x € So 
gts. Yo =1—e”, Ys = (1 — Qrit)"y, 
F™-[(\Ga) *h], Yn = FAP - ya’) #0n] 


Ya = 
Jn = F™. [Gxon], on = no(nf)(n = 1,2, ---) 
y=F-y Vy € D 
—f 
, _ 7. me ~\-2 _ Jé f>0 (29) 
k(f) = F-(1 — 2rit)™ = ‘ <0 


where \ and o are any frequency functions of Ca such that 
Mf) = . fel, Lh= Ofte, O<e< a 
0 Ff € [-efo + 2c] 
SoM cod, f oNar=1. 


In addition, let BV(J) and UL denote respectively the space of scalar 
functions of bounded variation on a closed interval J and the space of 
scalar functions satisfying a first-order uniform Lipschitz condition on 
some closed neighborhood of the origin. Finally, define the following sub- 
class of signal space > : 


Soo = {|x € So; [Ge — H(0")] € UL N BV(L.); (0) x 1. 
Lemma 7: Elements Ya , Yo, Ya» Yn Gn, k, A, and on are in D. 


Proof: This result follows immediately from the corresponding defini- 
tions and the test employed in Lemma 4 [ef. (22) and (23)]. 


Lemma 8: Solis] © 14... - 


Proof: Clearly, by (27) 
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SplGa] = Sols] U So[F-e"] € 4... (30) 
Also, taking any y € Ca for which S{g] © I_.., one obtains 
F"-[(F"-k) (F-g)]=0 Wf>0, 
or 
SF*-[ Fk) (Fg) I] & Bee. 
Hence, for all such ¢ 
F-ys(o) = ys(P-9) = yal(F 1k) (F-e)) 
F-y AF +[(F +k) (F-9)]) = 0, 
a condition implying that % = 0 on J_.. 


I 


Lemma 9: On I. , Hj = (AGja) #k = Gaand (F + ya’) ¥en = (AP Ya!) #6n = Gn - 


Proof: Take any ¢ € Ca for which Sly] G I. . With regard to the first 
relation 


SIF: ((F"-k) (F-9)]] & (— fo + 4, 
and by (30) 
F-y(e) = y(F-¢) = ya((F hk) (F-¢)) 
= Fey (F+[(F +k) (F-g)]) = Feya(QF:[(F™-k) (F-¢)]) 
= PF. [(P"-k)F (AF Ya) Ke) = [(AGa) *k] (¢). 


As to the second relation 
[on DoS + Dad = onlelS +)», 


Sl(2rif)onle(f + f))] S [-efo + 4, 
and by (30) and Theorem 1 
[(F-ya’) *on](y) = F-ya(2rifon(e(f + f))) 
= Feya(2nifd(f)one(f + f))) = [AF ya’) *on](g). 
Lemma 10: G1 € Lo(I,,) and Gn € Ca. 
Proof: On the basis of the Tonelli-Hobson theorem 
F™- (AF +ya)(e) = ya(F: (AF ™-9)) 
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= [mtr {f emacn | [fe ewmat] at ae 
= | alr) a | [ eon ja | ottai dr 


a [. i ya(7) (Fd) adr | g(t)dt Wee Ca 
Therefore, as indicated operationally, 
FU (AF+ya) = (FX) *Ya = [ ; ya(r) (FA) adr, 
and 
| FOP) |S (sup | ya(t) |) [ura la < @, 


which indicates that ya, ja € In(J,,). Lastly, since 
(AF ya’) € B([—efo + 2€]), 
FO-on € Ca; 
by Lemma 1 
F™-(AF ya!) € Cy, 
Yn = (Fon) [F°+(AF-yo’)] € Ca, 
Gn € Ca. 
Lemma 11: lim gn = 9; gn € Ca, and SolGnl G Ie . 


n>a 


Proof: As 
Him [ no(nprot sar = tim [oe (Lar = 000) We € Cr, 


then 


+. (D) 
lim’ ¢,. = 6, 


n 


and 


lim gate) = lim (gF*-on)(e) = lim on(F (gp) 


iF *-(geo)) = gF"-8(~) = ge) Wee Cu. 
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Furthermore, with o, € Ca, all three elements F*-on, gn, and gj, are 
also of Ca. Finally, from Lemmas 6 and 7 it follows that 


Sogn] = S,{g] U S[on] Cc I. 7 
Theorem 5: For x € Soo, elements g§, satisfy the functional equations 
f - - 
Gn(f) = ¥ | f Gn(f — f) dg.(f) — aC) | 


fel mos 1, 2.2, 


(31) 


where 
f - - 
iN =H -a0o+ fade rer 


9:(0) = 9(fo + «) = 0 
y= [1 -—H(07)]". 
Proof: According to definitions (29) 

Ya'F + on = (1 — wit) gnyo — Gn (32) 
with y» € Io(I,,), ya’ +n € Ca, and (1 — 2rit)g, € Ca. Expression 
(32) and Parseval’s relation combine to give 

(F-Ya')*on = Go*(Gn + Gn') —Grn fern, 


which by Lemmas 8 and 11 reduces to 
f 
(F-yeeon = [wD] oF —D — Zane —D at - aU. 


However, if considering f on J, , one need specify (F-ya’)*on and % on 
this interval only; consequently, Lemmas 9, 10, and 11 apply, yielding 


f 
m= [ a lar-D -ZaG-D|g-a tek. 


Clearly, in this equation, 7(0) and (for the development below) 
§a(fo + ¢€) can be set equal to zero without affecting the associated in- 
tegrals; hence, on integrating by parts, we get (31). To be noted in this 
theorem is that with respect to signal information, 7, , fa, and y derive 
solely from the bandlimited signal spectrum \g = AF’-e”; ie., 


Ya oe (A 7g rg) +k, 
Un re (2r1frAG) ¥en ) 
§o(0") = Ga(0"). 
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It is necessary to consider next some general properties of the integral 
operators in (31), viz., the mappings 


Baie | fog Naw - ain] fer. 


(33) 
v € C(I); n= 1,2,--: 
Relative to the domain of 7, , define the norm 
lvl = sup of] —-» € CCE) 
FET 
and metric 
p(vw) = |v — wll. vw € C(Ie). (34) 


Under this scheme the pair [C(I.), p:] = R. (more precisely, the pair 
consisting of C(J,) and the metric topology in C(J.)) forms a metric 
space which is complete.” We then have 


Theorem 6: Corresponding to any modulating signal x € Soo, operators 
T, constitute continuous mappings of the complete metric space R, into 
itself. 


Proof: To show that the range as well as the domain of 7, is in C(J.), 
take any v € C(I.) andset w = 7',-v foran arbitrary n. With « € So, 
% € BV(I.), and by Lemma 10, J, € Cz C C(I.); moreover, since 
ja = jf on I, (cf. Lemma 9), and since 


tf 
Gi = Ga — H(0") + jdf fel. 
g:(0) = fio +) = 0, 


function 7; € BV(I.). Therefore, by the Lebesgue convergence theorem 


Py [w(fe) — w(fi)] = im { vf ufo —f) — oi —f)) ag.(/) 


Ay [ v(fe — f) dgi(f) — vignCfe) — inten} = 0 Whi fe € Ie. 


That is, w € C(I,)(Tn: C(I.) — C(I.)). For establishing the con- 
tinuity of 7, , consider any two functions 2, , v2. € C(I,), and set w= 
Tn¥1, We = Tredv2, and vo = ve — for an arbitrary n. It follows from 
(33) and (34) that 
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i . s 
vf vf =) agslf) 





pc(W2, Wi) = sup 
fet, 


= | y | Vilfo + €) sup | vo(f) | 
f él, 
= | y | Vilfo + €) pe (v2, 01) 
where V;(f) signifies the total variation of 7; on the interval [0,f]. Con- 
sequently, p.(wWe, wi) — 0 if p.(ve, v1) — 0; 1.e., Zn is continuous. 
A basic result relating to the reconstruction of SSBOM signals can 
now be stated as 


Theorem 7: Corresponding to any modulating signal x € Soo, each of the 
equations 


In = Tn Gn Gn € C(L.); n= 1, 2, Pee 


has a unique solution given by 


Gn = Lim Ginn nym = 1, 2, - 
m-> oO 
7 Be, mM »~ 
Jn m1 = Th ‘Onl Wn,m 


Gn 1 0 ii € i. Vn 
where convergence ts uniform on I, . Furthermore, 


os = Im [FU lim™ «lim Gaim| = Im [lim lim: gun] 


d n ™m n m 


Gny Grim = 0 f € Ie Wn, m 


l 


where Im|-| indicates the imaginary part of the quantity in brackets. 


Proof: We employ here a standard fixed-point contraction-mapping 
theorem (cf. Ref. 17, p. 50): If p(-,-) and TY represent respectively a 
metric in a complete metric space R = [C,p] and a continuous mapping 
of R into itself, and if for some k and any two elements v,w € C 


p(T”-v,T*-w) S ap(v,w) a <1, 
then there exists a unique solution to the equation T'-vp = uv. Also, for 
an arbitrary element v € C this solution is given by 


v = lim T”-2, 


mo 


where convergence is taken relative to p. In view of Theorem 6 we need 
only demonstrate in the present proof that for some k and any two 
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elements v,w € C(I,) each mapping 7’, satisfies the contraction condi- 
tion 

| Tro — Tr*-w lle S allo — wl a<l. 


First, set 


I 


v— WwW 


M = sup | u(f)| 


SEL, 


With 7; € BV(I.) (cf. Theorem 6) 


vf wt =P ant | 


U 


[| w lle « 


|T,-v — T,:w| = 





Sf 
s Myf avi) =M\y|Vi) fe h, 


where V;(f) again denotes the total variation of 7; on [0,f]. However, 
inasmuch as 


[i> — H(0")] € ULN BV(L.), 
f 
Gi = f — (0) +/ He af #6 dg, 


9:(0) = §(fo + «) = 0, 
then 7; € UL MN BV(I,); hence, there exists a positive constant a» such 
that 


Vif) Saf fed. 
From this last condition it follows that 
| Trrv ae T,°© | = M | y | aof 
f f % 
IT,2-v — T,2-w| <M hy Pay | (ff) avlf) = M | y [ao [ vit) df 


oo M 2. 242 = 
M|y (ao | Faf = La Taos fee, 


IIA 


and, in general, 
M | y Fay" 
(k — 1)! 
M | ¥ |‘ao* 


rr. R\ (k~-1) 
= Far f Vif) d(f — f)' 


f 
| Patv — Peta [ G-p™P ary 


IIA 
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_M | Fao’ k= 
— Mater f jag a 
_ Mv ar’ f’ sa» 97 _ M |v Fao f* 
7 ele fF sales eet 


Therefore, for k sufficiently large 


M | vy |‘ao'(fo + €)* 


il =a<l, 


and 

| Too — T,*-w |e Sallo-wl. a@<1. 
The contraction principle as outlined then yields the main statement 
of the theorem, the last result being an immediate consequence of 
Lemma 11. 

Treated next are two important classes of modulating signals which 
prove to be contained in Soo : periodic functions of So and integrable 
(I,(7,,)) funetions of So having integrable Hilbert transforms. In the 
following development we represent the space of periodic functions by 
P and the intersection H-(L,(I,,.)) NM In(7,,) by L1(J,,). 


Theorem 8: Sy) NP € Soo. 
Proof: Elements « € So M P must have the form 


N . 
fons De bent 


n=—N 
Nf Ss fo; On = bei 


where 6 signifies the conjugate of b. Consequently, in accordance with 
Definition 2 


baer rset, 


Mz: 


2=2+2% = 


tl 
° 


n 


Putting 2, = 2 — bo, we obtain 


(1 — 2rit)™ ja — e) — °™ a (ie)" |, 


Yo 
or 


jf = (1 —e™)k — e™ > kei)" ) 
n=1 
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which, since 


Spl’: (t2p)"| S [nfp, ©) 
Sp La (iy)" CS [fo, ©), 


gives 
Sol — (1 — e)k] S [fp @), 
go(0") = (1 — e™) #1, 
and 
[G» — H(0*)] € ULN BV(L.). 
Theorem 9: So N £1(1,,) © Soo, and for all x in this intersection 


ot = Im [tim hal m= 1,2,->-, 


where 
Inga = (F+Xo) #[(Yo#F + Ao) Indl — “ (yal *+Xo) tel, 


h, = 0 oe ae 


o(f) = a 7 


Proof: Considering that « € So N £,(,,), z is a bounded element of 
Iy(I,,); hence, by Darboux’s formula 


1 
ly(t)| =|1—e*|<s j2| f tas | 2(t) | exp (sup |4|), (35) 
0 t€l,, 


and 


A 





Gof) — wh) S [fe —filexp (up |¢) 


sin (fo — fit | 
w(fo — fil " 


Ife filexp (sup 14) [ [2] a 


Vii, Se € ie 


NA 
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the latter condition indicating that %(0*) = %(0 ) = 0 (ef. Lemma 8) 
and that 7 € UL N BV(f,).* In order to prove the second part of the 
theorem, we note first that with « € So N Li(1,), 2 € Le(I,,) and 2 € 
Ly 1 C(Z,,); moreover, as Sp[2] G I., 2 € In(I,,) and g§ = 2rif(tz) € 
L, VL, N C(,,). Similarly, by (35), yo € Li N Lo(L,,) and G, € InN 
C(I,,). As a result, 


= | Geen | 


=| [ 00 Donat 


| Gn 


S sup | g(/) | I | o(f) | df S constant Wn, 
Fels 


' D) . ° ~ < 
lim” Gn = lim Gn = G, 
n 


n 


lin™ gj, = lim [(2rifGd)*on] = (QrifFad), 


and by the Lebesgue convergence theorem 


Ag fel. 


Il 


f 
§ = lim™ 7,-G, = | Gf — f) dg.(f) — 2niffar 


This expression asserts that @ is a fixed point of the mapping A: C( I> 
C(I.). Precisely the same arguments as were used in Theorems 6 and 


7 apply here to show that A is continuous with respect to norm || - ||. , 
and that 
“ = Im [lim “im m = 1,2,- 
Ama = A™-h, Wm 
hy = cee 
hn = 0 fé I, Vm 


where convergence is uniform on J, . On writing 7; as 


f _ 
j= Gardek + | (GardeB J 


* A similar calculation employing the Schwarz inequality shows that %(0T) = 0 
for z € L2(I,) also. Most square-integrable signals of practical interest satisfy 
the appropriate Lipschitz and bounded variation conditions, and are therefore 
contained in Soo . 
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ae To mids fo. f : 2 
oe [ gdel dF + [ el | [ Harel af’ | ij 

0 0 0 


f 
[aOng ser, 


I 


we get 
f 
fins = | inf — DieDAG) oF - Qwifer FT, 
or, more compactly, 


Sf 
jie A) i hnlf —F)GaCF)r0(F) dF — wif. f€ In. (36) 


(Since Sp[g] | [0, fo], Xo could be defined to have the same support. ) 
Taking the inverse Fourier transform of both sides of (36) yields the 
second part of the theorem. 


V. SUMMARY 


Definition 2 and Theorems 3 through 9, which constitute the principal 
results of the preceding sections, provide both a distribution-theoretic 
basis for the spectral representation of single-sideband angle-modulated 
carriers and a recurrence formulation for reconstructing most of the as- 
sociated modulating signals of practical interest. It is important to 
emphasize again that the approach employed in this development applies 
also to other modulation schemes. 
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APPENDIX 


Index of Symbols 


A —p. 2836 ea — p. 2824 
BJ) —p. 2817 F — p. 2816 
BV(I) — p. 2827 99m — p. 2827 
C*(I), Ca —p. 2813 H — p. 2823 
C; —p. 2817 | Sie De ae — p. 2813 


D —p. 2814 E — p. 2827 
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k(f) — p. 2827 Gq: — p. 2830 
ty — p. 2834 z — p. 2824 
N,No — p. 2814-2815 v —p. 2830 
P — p. 2834 6 — p. 2820 
S, Sp —p. 2815 r — p. 2828 
So, Sis So —p. 2824 pel, -) —p. 2832 
T. — p. 2831 on — p. 2827 
UL — p. 2827 pa —p. 2815 
x{-), x(-) —p. 2814 Sind — p. 2815 
x — p. 2823 L*Y —p. 2818 
x — p. 2816 lim™ — p. 2816 
Vi Us Ves Un — p. 2827 > —p. 2814 
Ye — p. 2824 l| v lle — p. 2831 
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On the Properties of Nonlinear Integral 
Equations That Arise in the Theory 
of Dynamical Systems 


By I. W. SANDBERG and V. E. BENES 
(Manuscript received May 4, 1964) 


This paper reports on some results concerning the properties of integral 
equations that govern the behavior of a large class of control systems or 
electrical networks containing linear time-invariant elements and an arbi- 
trary finite number of nonlinear time-varying elements. 

In particular, for networks containing linear time-invariant elements 
and an arbitrary finite number of positive-slope nonlinear resistors, it is 
proved, under reasonable conditions, that the response to a periodic excita- 
tion applied at t = O ts ultimately periodic with the same period as the 
excitation, regardless of the initial state of the network. 


I. NOTATION AND DEFINITIONS 


Let M denote an arbitrary matrix. We shall denote by M’, M*, and 
M~, respectively, the transpose, the complex-conjugate transpose, and 
the inverse of M. The positive square-root of the largest eigenvalue of 
M*M is denoted by A{J/}, and ly denotes the identity matrix of order 
N. 

The set of real, measurable N-vector-valued functions of the real 
variable ¢ defined on (— ~,« ) [ [0, )] is denoted by Sy [3Cy+], and 


tn = {rl se sey, [sat < «| 


Levy = {rlfe ates, f fifd< oh, 
The norm of f = (f:1, fe, --- , fw)’ € Len [Len4] is denoted by 


WA CWS lads 
it is defined by 
2839 
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ish= forse [isie= [rrae] 
and the norm of a linear transformation T defined on Loy [Lon4] is 
denoted by || T || [ |] T ll]. 
Let y ¢ (0,0), and, if f ¢ 3y, let 
fi=f for |t|Sy 
=0 for |t|>y; 
if f € Hy+, let 
fy =f for te [0,y] 
=0Q0 for t>-y. 
The sets Sy and &y, are defined as follows 
Sv ={f\|fesv, fyekw forO<y < o} 
v4 ={fl|fetn., fy e Lov, for0 < y < o}, 


With @ the set of N-vector-valued functions of ¢ which have the prop- 
erty that each component is uniformly bounded on its domain of defi- 
nition, let 


Lon = BN Kv, and Lavy = BN Tyg. 


Let 7 be a real positive constant and let 


Ben = {f1F € 8m, f® =fG@4+T) for allt, | f'fat < ot. 


Throughout the paper, & denotes a measurable, real N X N matrix- 
valued function of ¢ defined on (—~,), with elements {kmn} such 
that 


[ \bnn(@|at <2 (mn = 1,2, -+,N), 


and y[f(t),¢], with f ¢ 3Cy or f ¢ Hy, , denotes the N vector 
(Yilfi(d) él Welfe(t) t], +++, Wwlfw(t) £1)’ 


where 1(w,t), Yo(w,t), -+- ,Ww(w,t) are real-valued functions of the 

real variables w andi for —0o <w< © and —« <t< o such that 
(2) there exist real numbers a and 8 with the property that 

a < Vn(uy ’ t) ar Wn (we ; t) 


<6 (n=1,2,---,N) 
Wi — We 
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for allt ¢ (— «,«) and all real w; and wu, such that w, # we , and 

(it) Walw(t),t] is a measurable function of ¢ whenever w(t) is measura- 
ble (n = 1, 2,---,N). 

The symbol s denotes a scalar complex variable with c = Re[s] and 
w = Im[s]. 


II. INTRODUCTION 


Equations of the form 


g(t) = ft) + i k(t—r)vif(r),rldr, OSt< oe (1) 


in which f ¢ &y4 and y € Lavy , are frequently encountered in the study 
of physical systems containing linear time-invariant elements and an 
arbitrary finite number of time-varying nonlinear elements. Typically, 
f represents the system response and g takes into account both the 
independent energy sources and the initial conditions at ¢ = 0. For 
example, (1) governs the behavior of (a) an important type of control 
system containing linear time-invariant elements and an arbitrary finite 
number of memoryless time-varying nonlinear amplifiers, or (b) an im- 
portant type of electrical network containing linear time-invariant 
elements and an arbitrary finite number of time-varying nonlinear 
resistors. 
The related equation 


g(t) = s+ [ ke avli)aldr, 2 <t<o ©) 


is also often encountered. It arises when it is convenient for mathe- 
matical reasons to formulate a model of the system such that the re- 
sponse and excitation are defined for all t ¢ (— ©, ). In (2), usually 
g € Loy and only solutions belonging to Loy are of interest. 

One of the classic problems in the analysis of nonlinear physical 
systems is the determination of the properties of the response of a sys- 
tem, governed by an equation of the form (1), to a periodic input 
applied at ¢ = 0. Usually, the functions y,(w,t), which enter into the 
definition of y[- ,-], are independent of ¢; g can be written as g = gi + go 
in which gi € Ky 1 Love, go € Lowy, and go(t) > 0 ast— o; and (in 
accordance with the usual Volterra integral equation theory) it is known 
that there exists a solution f ¢ &y, . Ina great many cases of engineering 
interest it is simply assumed that there exists a unique response and 
that it is ultimately periodic with the period of the input. This is a 
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central assumption associated, for example, with the well-known de- 
scribing-function technique for the approximate determination of the 
steady-state response of nonlinear systems. 

In connection with the actual determination of the steady-state 
response, two common engineering assumptions are (in effect) that 
there exists a unique element of Lay M Ky , f, that satisfies 


a) =O +f ke-avif@lr, 2 <1<« 


and that the solution of (1), with g = g, + gz, approaches f(t) as 
t — o, the principal ideas evidently being that if the physical system is 
stable in some suitable sense, then the effect of the initial conditions at 
t = 0 should eventually “‘die out,” and, moreover, that the steady-state 
response of the system should be obtained “at once” if the periodic 
excitation is applied at “t = — 0.” 

The purpose of this paper is to report on some mathematical results 
concerning the properties of (1) and (2) that are pertinent, to a con- 
siderable extent, to engineering questions of the type discussed. In 
particular, as an application of our first theorem, we establish the 
mathematical validity of the engineering assumptions described above 
under what amount to reasonable conditions for the case in which 
k(-) is the matrix-valued weighting function of a passive network and 
v[-,-] represents N positive-slope nonlinear resistors (see Theorem 3 
and associated remarks). 

Under similar conditions, it is proved that an equation of the type 
(2) possesses at most one Lay solution. This type of result is of direct 
interest with regard to the qualitative nature of the solutions of (2), 
for if our conditions are met, and, as is often the case, (a) g in (2) is 
periodic with period 7, (b) the y,(w,t) are periodic in ¢ with period T, 
and (c) f is an Loy solution of (2), then [since f(¢ + 7’) is also a solu- 
tion of (2)] it is clear that f must be periodic with period T. 


Ill. RESULTS 


Theorem 1, below, focuses attention on a relation between the solu- 
tions of (1) and (2). This theorem is later used in order to obtain 
conditions under which the solution of (1) approaches a periodic steady 
state as i —> ©, when g approaches a periodic steady state ast > ~. 


Theorem 1: Let 


nt) = fit) + [xe — r)vlfilr),7]dr, —0 <t< 
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ho(t) = felt) + f k(t — r) vl fe(r),7]d7, 0O<t<o 


in which hy € By , fi € Ry 1 Ena, ho € Rwy, and fr € &v4. Suppose that 
(2) (fa — hy) € Lowy 
(at) [k(t — avr) ,rldr © Save 


and that, with 
K(s) -| k(the “dt for o =O, 
0 


(tit) det [ly + 3(a + B)K(s)] #0 for «20 
(wv) 3(8 — a) sup A{[ly + 3(a + B)K(iw)|"K(iw)} <1. 


oc w<00 


Then (fi — fe) € Levy, and, with 
p= sup A{[ly + 4(a + 8)K(iw)]} 


wom w<0o 


p= sup A{[ly + #(a@ + 8)K(iw)|"K(io)}, 


—o<mw<oo 


lf: — fe lle S oll — 2(6 — @) po} 


. 





hy — he — ie k(t — r)¥lfilr),rldr 





+ 








If, in addition to the hypotheses stated above, 
i): = tal) = ie k(t — r)¥lfi(r),rldr > 0 
ast— «, and 
[bal Pat < (mn = 1,2, ++), 


then [fi(t) — fo(t)] ~Oast— o. 
Our next result is concerned with the character of the change in the 
solution of (2) when g is altered by the addition of an element of Lon . 


Theorem 2: Let 


nl) =A + [kG — Duta), = -% <t< @ 


a(t) = felt) + [k(t — rvhfelr) lar, = 2 <t< @ 
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in which: hy, he € Ry; fi, fe € Lon ; and (hi — he) € Lay. Suppose that 
(2) 


[LF Vewa(e) 1a] 


and that, with 





at [| f Lboalo) el 


(mn = 1, 2,0 N) 





dt < @, 


K(Uiw) = i: i. k(te “dt, 
Giy-det (eds Ma SBR Ga) 0 for Gl w 
(101) 3(B — a) eal A{[lw + 3(a@ + B)K(iw)|K(iw)} <1 


—acw<oo 


Then (fi — fe) € Sen , and, with 
pi = sup A{ [lw + 3(a + 8) K (iw)]} 


wom w<eo 


p= sup Al[ly + + 3(a+ 8)K(iw)]'K (iw)}, 


aa 


lf: — fe || S mill — $(B — a)pol || ts — Pe J]. 


Observe that Theorem 2 implies that if (7), (77) and (277) are satisfied, 
then (2) possesses at most one Loy solution. 

As indicated earlier, in many cases of engineering interest g, in (1), 
can be written as g = gi + gz, in which g: ¢ Ky N Suny, go € Lowy, 
and g(t) ~ 0ast— o. In such cases it is often of considerable import- 
ance to determine whether f(t) approaches a steady-state response that 
is periodic with period 7 ast — «. Asa specific application of Theorem 
1, the following result is proved. 


Theorem 8: Let gi € Ky N Lows 5 92 € Love  G2(t) ~Oast— ©, n(w,t) = 
Vn(w,t + T) for all wandtandn = 1, 2,--:,N, andy[0,t] e Ky . Let 
f © y+ satisfy 


n®) +90 =f0+ f be-DI@rd, OSt< @. 
Suppose that 


* Wiese Nae 





dt << « (m,n = 1, 2,---, N) 





(ii) [ia LDenO Cake ae, Gan oem WN) 
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and that, with 
K(s) = | k(e“at for « 20, 
0 


(ait) det [ly + 3(a@ + B)K(s)] #0 for 20 
(iv) 3(B — a) ee A{[ly + 3(a@ + 8)K(iw)]"K(iw)} <1. 


Then there exists a unique f € Ky such that 
t 
alt) =f +f Re- Dv) rd,  —2 <t<o. 


Moreover, f € Son, (f — f) € Lanz, and 
f(t) —f()] 70 as too. 


With regard to the hypotheses of Theorems | and 3, it can be shown 

that* 
det [ly + 4(a + 8)K(s)] #0 for c=0 
and 
(8 — a) sup A{[ly + 3(a + B)K (iw)! "K(in)} <1 

provided that a = 0 and [K(iw) + K(iw)*] is nonnegative definite for 
all w. For this reason our results are particularly relevant to the theory 
of passive nonlinear electrical networks. 


IV. PROOFS 


4.1 Proof of Theorem 1 
Let K denote the bounded linear mapping of Loy. into itself defined 
by 


Kf = [ k(t —7r)f(r)dr, fe Lenz. 


With y an arbitrary positive number, and f an arbitrary element of 
nv, let P denote the mapping of 3Cy+ into itself defined by Pf = f,, 
and let yf denote the N-vector-valued function of t with values 


vif(t),{] for OSt < @. 


* The validity of the first assertion can be established with a standard argument 
involving the analyticity of K(s) for ¢ > 0. The second statement is a direct ex- 
tension of a result proved in Ref. 1. In particular, the greatest lower bound (over 
n) of the smallest eigenvalue of the term [ly + R,J7!* [ly + Ra + R,*]{lw + Ri), 
which appears in (7) of Ref. 1, can easily be shown to be positive. Thus, the con- 
clusion of Theorem 2 of Ref. 1 remains valid if the condition a > 0 is replaced 
by a 2 0. 
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Then from 
hy ay = [ kes Dunas 


a) SOL: i EZ SOO A =e, 


Ost< oa 
and the fact that 


p i =A ee: 


= P| k(t — r) (Why (7) ,7] _ Vi fey(7),7)) dr, 
we obtain 
hy = PII + 3(a + B)K\(fiy — fey) 
+ PK ivf, i Vfoy = 3(a a B) (fiy _ foy)} 


in which I denotes the identity operator on Loy4 and 


(4) 


hy = Iay = ny = Bf k(t — 2) (r) lar. 


In order to proceed we need the following result.” 


Lemma 1: Let det{In + 3(a + B)K(s)] 4 0 for «o = 0. Then 
[I + 3(a@ + B)K] possesses a bounded inverse on Lon +, and 


| 1 + 4(@ + B)K]" ||, _sup Milly + 3 (a + B)K (iw)\"} 
|| 1 + 4(a + 6)K]K ||, S ae A{[lv + 4(a@ + 6)K(iw)]"K (iw) }. 
Furthermore, 
Pil + 3(a+ 8)K]" = Pfl+}(a+6)K]'P forall y>0. 
Thus, since 
Pil + (a+ 6)K] “Pll + 3(a + 8)K] (fy — fey) = fry — Sov 5 
we obtain from (4) 
fy — fey = PIL + 4(a + B)KI"h, 
— PII + }(@ + B)K)'PK{¥f, — vf — $(a + 8B) (Sw — foy)}. 
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Using the fact that 
Il Yfiy — fey — 2(a@ + B) (fy — fey) Ile S (8 — &) Il fy — few Iles 
it follows that 
Il ftv — Sow + S || PI + 3 (a + 8)KJ A, [ls 
+ 3(8 — a) || PIE + (a + 8)K]'PK ||, 
| fin — Sey [I+ 
S || 1+ 4(@ + BKM [I+ ll by Ile 
+ 3(6 — a) || 1+ 3(@+ 8)KIK |], 
|| fis — fev ll+ - 
Using the inequalities of the lemma, 
\| fv — fev lle S pill — (6 — @) pol || Ay Il. 
pill — $(8 — a)p2)* (5) 


IIA 


. 











hy — he — . : k(t — r)Wlfi(r),rldr 





+ 


for all y > 0. Therefore, (fi — fo) € Ln4 and || fi: — fo || possesses the 
upper bound stated in the theorem. 
We now show that (fi — fo) € Lov+, 


0 
hi(t) — holt) — / k(t — r)vlfilr) 7Jldr7 ~ 0 as t>o~, (6) 
and 
[O [bel Pat <0 (mn =1%-,N) (1) 
0 
imply that [f:(t) — fe(t)] ~ Oast— o. 
Assume that (fi — fe) € ov, and that (6) and (7) hold. Then, from 
(3) it is evident that [fi(¢) — fo(t)] ~ Oast— © if 


[ kG - DUG) a1 — HAW dr +0 as te. (8) 


To prove that (8) is satisfied, observe first that (f1 — fe) ¢ Ley+ implies 
that (vfi — Wfe) € Lav . Thus it suffices to show that if g €¢ Loy, , then 


t 
[ kG — alr)ar + 0 as t{— ©. 
0 
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Let 
ive) 
. : —tuwt 
G(iw) = Lim. [ gle "dt, g € Lony. 


Then, in view of assumption (7), the modulus of any element of the 
N-vector K(tw)G(iw) is integrable on the w-set (— «©, ), and hence 
by the Riemann-Lebesgue lemma 


ns ; K (iw) Giw) deo, 
21 —o 
which is equal to 
t 
[ ka - Do@ar, 
0 
approaches zero as t — «. This completes the proof of Theorem 1. 


4.2 Proof of Theorem 2 
In this section, K denotes the bounded linear mapping of Loy into 
itself defined by 
Kp= | be a)flr)dr, Fe Sow. 


With y an arbitrary positive number and f an arbitrary element of 
Ky , P denotes the mapping of 3Cy into itself defined by Pf = f,, and 
yf denotes the N-vector-valued function of ¢ with values 


Vif(t),4] for -—2 <t< o, 
From the fact that 


hi(t) — helt) = fit) — felt) 


a 9 
+ [bE = WAG) A] — HG) Dar, 


we obtain 


hy = fiy — fou + Kf — Vey) 
= (I+ 3(a ae B)K)\(fiy = foy) 


+ Kivfiy — fey — 3(a + B) Sw — fov)}; (11) 


in which I denotes the identity operator on Loy , and 


(10) 
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y(t) = Iay(t) — Iaylt) + f(t — 2) lfia®) 1 — Vine) Dar 


— Pf ke = 2) (the) — vale) Dar. 


At this point we need”’t 

Lemma 2: If det [ly + 4(a@ + B)K(iw)] ¥ 0 for all w, then 
[I + 3(@ + 8)K] 

possesses a bounded inverse on Loy , and 


| E+ 3(a + 8)KT* || S_ sup Affe + 3(a + B)K(iw)T} 
| 2+ 3(a + AKT K || S sup Ally + 2(a + B)K (tw)! "K (iw)}, 


Thus from (11), 
fy — fay 
= —[[+3(a + 6)K] ‘K(f — vfy — (a + B) (fy — fev)} 
+ [I+ 3(a + B)KI hy . 
Using the fact that 
Il ¥fty — You — 2(a@ + B) (fy — fox) || S 3(8 — &) || fv — fa II, 
we have 
Il fy — fey | S$ 3(8 — @) || I+ 3(@+ 8)K)K || - |l fy — fey Il 
+ |] E+ 3(@ + 8)KI™ |] - |} hy Ul. 
In view of the inequalities of the lemma, 
Il fy — Soy | S mill — 3(8 — @)py]™ |] hy |. (12) 


Assume now that there exists a constant c such that || A, || < c for all 
y > 0. Then, from (12), it is clear that (f1 — fo) € Lay . This implies 
that (Wfi — Wfe) ¢ Lay. Hence, (9) can be written as 


hi — he = fi — fe + KW — vf), 


from which it follows, by essentially the same argument as that used to 
obtain (12) from (10), that 


¢ With no more than a reinterpretation of the functions involved, the proofs of 
the inequalities of Lemma 1 suffice to establish the inequalities of Lemma 2. 
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Il f: — fo ll S ell — $(8 — @) pp} |] Ar — Fe |. 
Therefore to complete the proof of Theorem 2, it suffices to prove 


Lemma 3: If (hi — he) € Sen, fi, fo € Lon, and assumption (7) of The- 
orem 2 ts satisfied, then there exists a constant c such that || h, || S c for all 
y > 0. 


4.2.1 Proof of Lemma 3 
Let g = (%@,@ 5°": , ay)’ = (Wh — vf), 
6(t) = 1 for |t| sy 
=0 for |t|>~¥y, 


and 


w= (ui yuey yun) = fo bE = r)l0(r) — oD \a(r)ar. 


Then, since (hi — he) € Loy , it is sufficient to prove that there exists a 
constant ¢; such that || u || S c for all y > 0. Further, since 


[uh = > fo len Pat 

















= [DL bnalt — 200) — oCO)laa( ara 

<N 2 3) [| Fe bane = 2)10(7) = 0)lau(r)ar a 

< wD = CE Pech) OG) =9@ del ae 
in which 


n = max sup | gn(t) |’, 


it suffices to show that there exists a constant c. such that for all y > 0 


fe 


2 


[baal eV GC) oO ae) ae 





(m,n = 1,2,---,N). 
Using the fact that 
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[LB mn(2) | L(t — 2) = 0(8) | ar 


IIA 
< 


= | | kun(r) |de + [| malt) ler for [| 
t+y —00 


tty 
a | | Kmn (7) | dr for | ¢ | > Y; 
t—y 
it is a simple matter to verify that 
ioe) ie) 2 
[LF Venn (t = 2) | 0G) = 0 | ar] at 


2 [| J \tnnls) [ar 
am 
+f 


from which it is evident that our assumptions imply that there exists 
a c. with the required property. This proves the lemma and completes 
the proof of Theorem 2. 








2 
dx 








Gee oP iE: xa) (ae 





2 
dx 








J Van (7) | ar 


2 
dx, 





at2y 
[ (Banle) | ar 





Remark: 


Assumption (2) of Theorem 2 is satisfied if 
[| than (@) [dt < oO, (n,m = 1,2,---,N), 
for then the (bounded ) functions 
[ | kmn(x) | da and a | Reveney | ae 
are integrable on (0, ) and (— «,0), respectively. 


4.3 Proof of Theorem 3 
We need two lemmas. 


Lemma 4: Let ~[-,-] satisfy the conditions of Theorem 3, gi € Ky , and 


K(iw) = i k(t)e **dt. 
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Suppose that, with It the set of integers, 


(i) det | 1s + 4a+ B)K (=) ~0 for ne 


(i) 4B — a) sup A {hs + He -+ aK (SE) | x (=) rae 


Then there exists a unique f € Ky such that 
t 
nt) =f + [ Ke-ei@)rld, © <t<e, 


Proof of Lemma 4: 


Theorem 4 of Ref. 1 and the remarks relating to its proof imply 
that the conclusion of Lemma 4 is valid if the hypotheses of the lemma 
are satisfied and the condition 


sup A ths + $(a + B)K (a) } < © (13) 


is met. However, since every element of K(i2mn/T) approaches zero 
as |n|— ©, assumption (7) of Lemma 1 implies that 





inf 
ney 


det [ts + 3(a+p)K (32) | > 0, 





Therefore, in view of the fact that the elements of K(72rn/T') are 
uniformly bounded for n ¢ 9, it follows that (13) is satisfied. This 
proves the lemma. 


Lemma 6: Let p[-,-] satisfy the conditions of Theorem 3, let fe Ky, and 
suppose that assumption (it) of Theorem 8 ts satisfied. Then 


is k(t — r)Wlf(r),7I]dr € Lon . 


Proof of Lemma 6: 
Let g(t) = v{f(t),t], and 


Uu= (W,m,°° -Un) = [x — r)q(r)dr. 


Then q ¢ Ky, and 
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frin(®) Sf [ant — 2) [of gale) | 


IA 


Df ban Ce) [+] ant = 2) de 


Furthermore, 





eee net eee ee 





_ dn (t a T) : 


l+fer a 








< [1+ r)kun(s)Far | 


and the last integral can be bounded as follows 


“la@— Py & fo" |e — oP 
i 1l+pr We a oe 


(: 2 p» (my) Me | an(t) fF de. 


Thus, the w(t) are uniformly bounded on (— ©, ), which proves the 


lemma. 
Theorem 3 follows at once from Lemmas 4 and 5, Theorem 1, and 


the fact that assumption (7) of Theorem 3 and f ¢ Loy imply that 














IIA 


i CRE Le eee ee 
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Applications of a Theorem of Dubrovskii 
to the Periodic Responses of 
Nonlinear Systems 


By V. E. BENES and I. W. SANDBERG 
(Manuscript received May 14, 1964) 


Dubrovskii’s theorem on completely continuous operators that are asymp- 
totic to zero 1s applied to the study of the existence and uniqueness of periodic 
responses of nonlinear systems to periodic driving signals. Examples of 
nonexistence and nonuniqueness are given, a relationship between non- 
uniqueness and subharmonics is noted, and some general existence theorems 
are proven, giving estimates on the magnitudes of the harmonics. 


I. INTRODUCTION 
In 1939 V. M. Dubrovskii! proved the following result: 


Theorem 1: If A is a completely continuous operator which maps a Banach 
space X into itself, with the property that 


in |All 


=——-_ = 0 reX 
Iz || 2 || ; 


then for each scalar \ and y ¢ X, the equation x = y + Az has at least 
one solution xe X. 

Dubrovskii’s theorem was stated in the long review article of M. A. 
Krasnoselskii’ on problems of nonlinear analysis, but except for a recent 
application,’ it seems to have gone largely unnoticed. It is the purpose 
of this paper to indicate some applications of the basic idea in the 
theorem to integral equations (and systems thereof) that arise in the 
study of nonlinear electrical networks and automatic control systems. 

The applications to be made all center around the existence and 
uniqueness of periodic responses of nonlinear systems to periodic driving 
signals. These properties of the equations governing nonlinear systems 
are frequently taken for granted. The fact is, though, that these are 
by no means universal properties of such equations, as simple examples 
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(to be given) will show. Often, the nonexistence of periodic responses 
is related to instability of the nonlinear system, while their lack of 
uniqueness is closely connected with the possibility of responses with 
subharmonic components. Thus it is important, in control and circuit 
theory, to be able to distinguish nonlinear equations that have unique 
periodic solutions for periodic inputs from those that possess several 
such solutions. With the aid of the idea underlying Dubrovskii’s theorem, 
we examine this problem in the present paper for systems described by 
the nonlinear integral equation 


a) = yl) + [ke - wy e(w),u) du, (1) 


(and by a vector analog thereof,) where y(-) is an input, k(-) is an 
integrable (Z,) impulse response of a linear system, and ¥(-,-) repre- 
sents a periodically time-varying nonlinear element. Periodic solutions 
of (1) have already been considered in previous work of one of the 
authors;* almost periodic solutions of (1) have been studied in previous 
joint work’ of the authors. In both these papers a basically different 
assumption about the growth of the element ¥(-,-) (from that to be 
made here) was used. 


II. SUMMARY 


A discussion of the abstract Banach space setting for Dubrovskii’s 
theorem appears in Section III. It includes a quick proof of the theorem 
from Schauder’s fixed-point principle. There follows in Section IV an 
account of mathematical preliminaries, assumptions, definitions, etc., 
requisite for our remarks about (1). These remarks begin, in Section V, 
with a simple example showing that (1) may have no periodic solution 
and continue in Section VI with an existence theorem, for periodic 
solutions of (1), based on the principle of Dubrovskii’s theorem. In 
Section VII we apply this result in discussing an example of nonunique- 
ness due to existence of subharmonic solutions. In Section VIII it is 
shown how the bound on the norm of the solutions obtained in Section 
VII can be improved. In Section IX, finally, a vector analog of the 
existence theorem of Section VII is stated and its proof sketched. 


III. BACKGROUND DISCUSSION 


We recall® that an operator A taking one Banach space into another 
is termed completely continuous if and only if it is continuous and carries 
every bounded set into a compact one. Dubrovskii’s theorem for such 
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operators is a straightforward consequence of Schauder’s fixed-point 
principle:’ Let S be a bounded, closed, convex set of a Banach space X. 
Let A be a continuous transformation of S into a compact subset of 
itself. Then there exists at least one point x e¢ S such that « = Az. 

An operator A satisfying Dubrovskii’s condition 


im LAtll 


0 
zoo || z|| 





is said to be asymptotically close to zero; explicitly, the condition is that 
for every « > 0 there is anr such that || z || = r implies || Az || < «|| 2 || . 
To prove Dubrovskii’s theorem we seek a closed ball, of radius R to be 
determined, that is mapped into itself by the (completely continuous) 
operator G defined by 


Gr = y+ Ax 


with \ and y ¢ X fixed. Let « be a number such that 0 < |A|{e« < 1, 
and pick (by Dubrovskii’s condition) an r > 0 such that || 2 || = r 
implies || Az || < «|| 2 || . If now s is a positive number such that 


> lvl 


then forr S || ]| Ss 
| Gz |] Syl] +1A]- | Ae] 
ly ll + [Alelall 


s. 


IIA 


IA 


Since A is completely continuous, the set 
(Aa: || x]] S 7} 


is compact. Thus the continuous function || Az || defined on { |] x || S 7} 
is bounded. If #2 is chosen as 


= Il y || 
= max { Hi yi) + 11 sup lh Act} 
then || x || < R implies || Gz || S R. The closed ball of radius R is convex, 
and the existence of a fixed point of G in the ball follows from Schauder’s 
fixed-point principle. To establish the result for a particular value of \ 
it is not necessary that A be asymptotically close to zero; clearly, it 
suffices that there be « such that 0 < « < || “andr such that |] x || > r 
implies || Az || < e|| a |]. 
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IV. PRELIMINARIES 


We shall be concerned throughout with the case in which the functions 
x(-) and y(-) of interest are periodic and square-integrable over a 
period. By L.(—T,T) we denote the Banach space of all functions x(- ) 
of period 27 that are real-valued, measurable on [— 7,7], and for which 


the norm 
Jz] = (4 [ier a) 


is finite. According to standard results in the theory of Fourier series, 
such a function is represented in the mean by its Fourier series 


N 
a(t) = lim. >> ane"? 


N+o m=—N 


with Fourier coefficients 
ln = oe [ a(the i!” dt —-o0 <i m< o, 
2T J_r : 


The norm of «(-) and its Fourier coefficients are related by the Parseval 
identity 


© 


(2) Sie 

We shall need the following two facts from the theory of Fourier 
series: (1) If 2(-), w(-) e¢ L.(—T7,T), with respective Fourier coeffi- 
cients {z,}, {wa}, then 


T 
ot | a(t — u)w(u) du = Do zawne™™"”, 
—T n 


the series on the right converging absolutely and uniformly; (2) the 
Fourier coefficients of 2(--+e) are {e7"*/7z,}. 
The notation 


1 T 
leh =saf le {ae 
is used occasionally. 
For a periodic function 2(-) ¢ L.(—T,T) we define the functional 


1 T 


u(z,€) = iF bat +e) — z(t) | dt, 
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proportional to an “integral modulus of continuity,” and we remark 
that one of the usual arguments for the Riemann-Lebesgue lemma gives, 
for n ~ 0, the inequality 


1 f* 
an | abe rut a 
—T 


= la (ie 2(t[1 — een a 


| @n | = 





so i i he Pa eainy, 


We shall make two assumptions about the nonlinear element y(- ,-), 
one about its growth and one about its continuity: 

(a) there is a function \(-) nondecreasing on [0,«) such that for 
all v, ¢ 


| ¥(v,t) | SAC] o]), (2) 


(b) the function y(- ,-) is continuous in the first variable uniformly 
in both variables. Then its modulus of continuity w(-), defined by 


w(6) = sup|¥(u,t) — ¥(v,t)| for |[u—o| S64, (3) 


is a continuous monotone function, and approaches zero with 6 — 0. 
When y(v, -), considered as a function of t, has a modulus of continuity 
wo(+), so that 


| ¥(v,t + €) — (v,t) | S wo(e) 


for all v and ¢, we set 


_ JO n=0 
fn \en(T/n) nO. 


Jensen’s inequality for a concave function ¢(-) reads 
b b 
[ s@)p@) az\— [ o(s@))p (a) ax 
a ge 
[ @) ae [ p@ ae 


where ¢(-) is concave in an interval containing the range of f(-) over 
[a,b], p(x) = 0, p ¥ O, and all the integrals in question exist. 

We now return to k(-) in (1). Since k(-) belongs to Z,, it has a 
bounded Fourier transform 


(4) 


0) 
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K(a) = nyt [eR at. 
The convolution operator K on L.(—T,T) defined by 
Ka(t) = [a(t — wu) du 
is described in terms of its effect on Fourier coefficients by the identity 


(K2)m = (2n)K (7) am, 


and takes L.(—7,T) into itself continuously. 


V. NONEXISTENCE OF SOLUTIONS 


It is easy to see that in some very simple cases (1) has no periodic 
solution. An example is furnished by 


Remark 1: If ~(v,t) = v for all v and ¢, and if, for some integer n, both 
the nth Fourier coefficient y, of y(-) does not vanish and 


hn = (20)'K ("") = 1, 


then (1) has no periodic solution «(-) belonging to L, (—T,T). For if 
there were such a solution, the left side of (1) would have nth Fourier 
coefficient x, , while the right-side would have yn + an ¥ tn. 


VI. EXISTENCE OF SOLUTIONS 


Theorem 2: If \(-) and w(-) are concave, y(-) ¢ Ie(—T,T), 
2 
(FI <- 


r= {yl + a(r) (5) 


has a positive solution r, then there exists a solution x(-) of (1), with 
pertod 2T, and such that 


lel sr 


[tm | Slum | + 2n)K (2) x6) 


where tm, Ym are the respective mth Fourier coefficients of x, y. 


ie] 


re =m >, 


Ma=—eoO 





and if the scalar equation 
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Proof: In the complex sequence space l. of Fourier coefficients, isometric 
to L.(—T,T), consider the set § of sequences x = {a,, -~» <n < «} 
such that, with the overbar denoting the complex conjugate, 


L—m = Lm 


Lum l+ | (% m) | XG). 


IIA 


| 2m | 


By Minkowsk?’s inequality, x ¢ § implies 
| «|| = (QE | tm |” silyl tac) = 


The set $ is compact, being an analog of the Hilbert cube or parallelotope. 
It is easily verified that $ is convex. 

Now let x(-) ¢ Zn(—T,T), and consider the magnitudes of the 
Fourier coefficients of the function w(-) defined by 


w(t) = o(x(t),t) = p(a(t + 27), t+ 27). 
We find 


|wn| = on | wlatonern a < ay | ¥(a(Z),t)| de 


sn | dC la(u)|) a a 


(af | a(u)| au) 
(|| @ [)), 


where the second inequality follows from the fact that (-) bounds 
the growth of ¥(- ,¢), the third inequality follows from the concavity 
of \(-) by the Jensen inequality (4), and the fourth inequality follows 
from Schwarz’s and the monotone nature of A(-). Hence if || z || S 7, 
then | ym + (Kw)m| S |ym| + (22)? K(mx/T) d(r), and it follows 
that the operator A defined on L,(—7,7) by 


IIA 


IA 


lA 


Aa(t) = y() +f ke wyle(w)au, |e] <7 


maps the ball |/ z || S r into the compact, convex, isometric image of §$, 
that is, into a compact, convex subset of itself. Continuity of A on the 
image follows from that of K and from the inequality 
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ce ie | w(xr(u),u) — o(ar(u),u)| du < w(|| a — 2 I), 


provable by the same method as (6). Existence of a fixed point of A 
in the isometric image of S$ follows from Schauder’s theorem. 

We remark that if \(w) = o(w) as u > o, then a solution 7 to the 
scalar equation (5) always exists. This occurs, for example, if 


A(u) = Bu* 6B > 0, 05a <1. 


VII. NONUNIQUENESS AND SUBHARMONICS 


It is known that a solution of (1) may have Fourier components, 
called ‘“‘subharmonics,” of period greater than 27. Our purpose is to 
remark that if this occurs, then (1) does not have a unique solution, 
and in fact, the greater the period of the subharmonic, the more distinct 
solutions exist. We start with a simple example: Let 7 = 7/2, set 


(ut) = sgn u- |u|?) 
y(t) = 3 — 3 cos Bi)? (7) 


let 
a(t) = 3+ 4sint — 4 cos 2t, 
and for K(-) take any Fourier transform of an integrable function 


with K(0) = 0 and K(1) = 4(2r)*. For example, the fourth-order 
filter 


(2) 716 (iw)? 
(1 + iw) * 
will do. Actually, since we need to prescribe only the two parameters 

K(0) and K(1), the second-order filter 
(Qr) tw 
(tw)? + ftw + 1 


K(w) = 


K(w) = (8) 
would do as well. 

That «(-) as defined is a periodic solution of (1) of period 27 can 
be verified from the identity 


2+sint = (2+ 4sint — 3 cos 2t)’. 
This example, in which the solution x(-) contains the subharmonic 
component 4 sin t, is adapted from Hughes,* and has been used earlier’ 
by the authors merely to illustrate the real possibility of subharmonics 
in relatively simple systems. 
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Now since the input y(-) has period z, while the response x(-) has 
period 27, it can be seen that by shifting x(-) by -t7, that is, by changing 
the sign of (all) the odd components of z(-), another solution of (1) 
for this ¥(-) and y(-) is obtained, because 


k(u)du. 





vito) =y() +f sgnelt-e x — w) later — w) 


Thus, there are at least two solutions of (1) for this example; the two 
we have identified so far differ only in phase. As an application of 
Theorem 2 we show that there is at least one more solution, one that 
has period +. The following lemma establishes a Hélder condition for 
the nonlinearity of the example: 


Lemma 1: If 
¥(v) = sgnvio|? 
then for all v and « 
| ¥(v +e) — vr) | Sel’. 
Proof: First suppose that sgn (v + ¢) ¥ sgnv. Then|e| = |v +e] + 
|v |, and concavity gives, by Jensen’s theorem, 


lv +e) —vv)| =lv+el+]of 
go (tel loll aye 


If sgn (v + €) = sgn», there is no loss of generality in supposing that 
v+e> v2 0, because y(-) is odd. Then using concavity again 


vv) 2- 7m vv + 6) 


€ 


vu+e 





IV 


y(e) = v(v + e). 





Hence in this case 
OS vtec.) —Wv) S We) = le]*. 


A direct application of Theorem 2 shows that (1) for the example 
(7), (8) has a solution of period 7. The scalar equation 


ly] tet =r 


is appropriate, and has a positive root r. 
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The example just discussed illustrates the following general principle 
regarding subharmonics: 


Theorem 8: If (1) has a solution x(-) with (minimal) period 2nT,n > 1, 
then each of the functions 


x(t + 2kT) k=1,---,n 
ts a (distinct) solution of (1). 


Proof: Since y(-), and the time-dependence of y(-,-) have period 27, 
we have 


a(t + 2kT) = y(t) + le W(a(t + 2kT — u),t + 2kT — u)k(u)du 
= y(t) + is W(a(t + 2kT — u),t — u)k(u)du. 


VITI. CLOSER BOUNDS ON FOURIER COEFFICIENTS 


By a more penetrating analysis it is possible to strengthen the bounds 
on the norm and on the Fourier coefficients given by Theorem 2. For 
example, the inequality (6) merely establishes a uniform bound X(r) 
for all Fourier coefficients of functions 


w(t) = w(x(t),é) 

for || x |] S r. However, since the argument for (6) shows that w(-) is 
absolutely integrable over a period, its Fourier coefficients actually go 
to zero at infinity, and it should be possible substantially to improve 
the estimate (6). This can be done with the help of the quantities 
{Qm, —%2% <m < }, and the functional p», defined in Section IV. 

Throughout this section, it is assumed that ¥(v,-) has the modulus 
of continuity wo(-) as a function of ¢, and that 


iS K (™2) 


It follows from (9) that there is a function h(-) ¢ L2.(—T,7) such that 
for any z(-) e In(—T,T) 


of [ h(t — u)x(u)du = iz k(t — u)x(u)du; 


eae (9) 








the Fourier coefficients of h(-) are 


hin = (2n)'K (™2), —-x <i m< o, 
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For each positive number s, and m = 0, we define 
An(s) = u(y,T/m) + X(s)u(h,T/m). 


Since for 7(-) ¢ Le(—T,T), u(2,e) — 0 as ¢ — 0, the numbers a,,(s) 
are bounded in m =~ 0 for each fixed s. By the Riesz-Fischer theorem 
there is for each s > 0 a function u,(-) ¢ Le(—7,T) whose Fourier 
coefficients are 


ils) = 


with (ef. Section IV) 


{@(Gm(S)) ++ dm}, m #0 


2|hm| 
| A(s) m = 0. 


| 
| ho 


_ JO n=0 
aS wo( T'/n) n £0. 


Theorem 4: Let \(-), w(-) be concave, and let r be a positive number 
satisfying the inequality 


lly ll + lull sr. 
Then there exists a solution x(-) ¢ L2o(—T,T) of (1), such that 


| x [| S71, 
u(x, T'/m) = Am(T), m #0 
Lee || we | Ete (P| all m. 


Proof: Let the operator A be defined on the ball { || x || < r} in L,.(-—7,7) 
by 


Ax) = yl) + [Rl — wy(olw)urau 


=) +55 f W(t = wou) udu 


The argument of Theorem 2 shows that A maps { || z || < 7} con- 
tinuously into Z.(—7,7). Further, by Fubini’s theorem and the con- 
cavity of A(-), 


ap is [. [h(t + — u) — A(t — w)|-\b(a(u),u)| du 


= nlh)-sn | Le o(w),u)| du 
< u(he)A(|| |), 


lA 
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and we find that |] «|| < r implies p(Az,T/m) S an(r) for m # 0. 
Moreover 
| Wala t €)u te) — (a w) re) | S of |e(u+ €) — 2(u) |) + ole), 
so that the concavity of w(-) implies 
2u(H(x),€-) S w(u(z,e)) + wole). 
It follows that || x || S r implies 
| (At) m| S [ym] + | hm | {olam(r)) + dm}, m0 
and also 
| (Ax)o| S| yol| + | hol Cr). 
Let S$ be the compact set of J, sequences 
x= {%,-7 <n< o} 
such that 
Ca = Tins 
[em | S| ym] + | Um(r) |. 


It can be seen that A maps the ball { || x || S$ 7} into the isometric 
image in Le(—T,T’) of 8. This image is compact and convex, and The- 
orem 4 follows from Schauder’s fixed-point principle, as did Theorem 2. 


Theorem 5: Let \(-),w(-) be concave, let y(-) ¢ In(—T,T), and let 
there exist a positive number r and a real bounded sequence b = {bm ,m ¥ 0} 
satisfying the inequalities 


ly th + Wh lhaCr) Sr 





uly,P/m) + 2, 





sin as 

2m 

Then there exists a solution z(-) ¢ L.(—T,T) of (1) such that 
lm | S| ym| +4 | hm |{o(be) + an}, m0. 


Il «Il 


IA 


IIA 


r 


Proof: Let R be the compact, convex subset of I2(—T7,T) consisting of 
functions 2(- ) such that 


Il 2 Ih 


<r 
2Qu(z,T/m) S ba; m £0. 
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Let x(-) € R, and let {y,, —2© <n < o} be the Fourier coefficients 
of the function w(-) defined by 
w(t) = ¥(2(é),t), all ¢. 
Then the concavity of \(-) implies that 


lyol S (an [lx olae) < X(r) 


and that of w(-) implies that for m # 0 
ae 
2T 
w(Qu(x,2/m)) + dn. 


[Yn | Sar [wll ee + P/m) — a(t)|)dt + dm 


IIA 


IIA 


Now 
Ax(it + €) — Az(t) 
T 
=yt +) -yO +] Wee = Ww ~ AO = WGW,udu 
and the second term on the right is 


Dy dine” = 1)y,e""", 
n 


the series converging absolutely and uniformly to a quantity of modulus 
at most 


2 nal 
nx¥0 


Hence, with « = 7'/m, m # 0, 





in| iva. 


2u(Az,T/m) & b 


At the same time 


male [Aol dt sz, | “lula tae [ [he— 1) | [vol a) lara 


IIA 


ly lh + | A IAG) 


r. 


lA 


Thus Az(-) belongs to R. The result follows by Schauder’s theorem, 
as before. 
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Remark: Let w(-) be concave, with w(wu) = o(u) as u > oo. 
Let {2, ,n ~ 0} belong to I, , and Ict {h, ,n # 0} belong to 1, ly. Then 
there exists a minimal bounded sequence {b, , n # 0} satisfying 


| zn | + 2y 


The sequence {b,} is minimal in the sense that its components are less 
than or equal to the corresponding components of any other sequence 
satisfying (10). 

Let B be the set of sequences v satisfying (10). To prove B is non- 
empty, let 


ml im | (bm) S n#0. (10) 


sin 5 ——~ 








0 n=0 

ey 

m~+0 

and let r satisfy || z || + || « |] o(7) Sr. Define w = {w, ,n # 0} by 
Wn = | 2r| + urw(r) S 


Un = 





sin || in| n € 0, 


Then 


sin | Rm | o(Wm) S 


Jén| + 2y 


so that w e B and is bounded. Now set b, = inf v,. For any ve B 
veB 








IA 


len] + 2) 
m0 








sin || he | o(bn) S Jen | + 2y 


- wm 
sin 5 | | Rn | (0m) 


IIA 


Un. 


Thus b ¢ B and is minimal. 


IX. THE VECTOR EQUATION 


In this final section, we consider a vector form of the integral equa- 
tion (1). Let k(-) be an N X N matrix of real functions of LZ; , and for 
each t, let y(- ,t) be a real N-vector valued function of a real N-vector. 
Let y(-) be a real N-vector valued function of time t. With these re- 
interpretations of the notations in mind, we can leave (1) unchanged. 

With M a complex matrix, we let M’,17, and M* denote the trans- 
pose, the complex-conjugate, and the complex-conjugate-transpose, 
respectively, of M. The positive square-root of the largest eigenvalue 
of M*M is denoted by A{M}. 
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If v is a real or complex N-vector, its norm is defined as the ‘‘Euclid- 
ean” norm 


ol = (3 levt?) = (ote 


It is well-known that 


A’{M} = sup o°M*Mo (11) 


lol]=1 


and hence that || Mv || S A{M} || v || for complex N-vectors v. 
As previously, L2(—T7,7') is the space of real-valued, measurable, 
functions x(-) of the real variable ¢ which satisfy 


(t) x(t + 2T) = 2(Z), 
Ge = Eo dt < «. 


We take as our basic space the Nth power of L.(—T,T), i.e., 
L,*( ~~ EY) ’ 


and think of it as composed of column N-vector valued functions of 
time. A norm for L,”(—7',T) can be defined by the formula 


a | = on [wnat 


. mele D, | v(t) Pat 


where « = (41, °:- , 2y)' ¢€ Lo*(—T,T). This norm makes L;"(—T,7) 
a Banach space. Further, an element x(- ) of Lo”(— 7,7) has the Fourier 
representation 


a(t) = lim. >> tne? 


nen m>=—7N 


where the N-vector x, of mth Fourier coefficients is given by 
7. [ —rimt/T 
im = oF & a(te dt, 


and the Parseval identity 


io) ao 


trim = Le lem | = lef 


m=— m>=—o 


for x ¢ L,”(—T,T) holds. 


2870 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 
The matrix convolution operator K is defined on L.*(—T,T) by 
Ka(t) = | k(t — u)a(u)du 


and the operator y by 
yu(t) = y(a(t),t), all t. 
Equation (1) assumes the concise form 
x= y+ Kye. 
The matrix K,,,m = 0, +1, --- is defined by the condition 


Ko Gee = fi e mel? b(t) dt. 


— oO 


It is assumed that >> A{K»} < o. This condition is met, e.g., if 


2d, | kas” P< 2 


™ 


for 1 S i,j S N. The matrix convolution operator K takes a function 
a(:) ¢ L:*(—T,T) with (vector) Fourier coefficients x, into the func- 
tion z(- ) whose coefficients are 


2 ay m=0,+1,:---, 


and the Riesz-Fischer theorem guarantees that 2(-) ¢ L.”(—T,T). 
Further, by formula (11) we have 


I| 2m |] S ALK} || em |. 
An analog of Hilbert’s cube in L.*(— 7,7’) is described by 
Lemma 2: Let {en, —~ <n < o} be nonnegative real numbers with 
dX Cee 
Then the set 
{ee Le”(—T,T): || an || S en, all n} 


as compact. 

This result is a consequence of a known theorem. (See p. 136 of Ref. 
5:) 

Analogs of the growth condition (2) and of the uniform continuity 
condition (3) will be used. These are 
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(7) [| but |] S AC |] wl), for all ¢ and all real N-vectors u where 
A(-) 1s a monotone function. 

(ii) W(-,-) is continuous in the first variable uniformly in both 
variables; its modulus of continuity w(-), defined by 


w(5) = sup || ¥(wt) — vest) |] for Ju — ol] Ss 6 
is a continuous monotone function that approaches zero with 6. 
Theorem 6: If \(-) and w(-) are concave, y belongs to L.“(—T,T), 
= DAK pn} < ©, 
and tf the scalar equation 


r= |lyll + «NX(r) 


has a positive solution r, then there exists an element x ¢ L,‘(—T,T) satis- 
Fying 


x=y+ Kye 
||] Sr 
Il em [| S |] Ym |] + AL Kn} NAC), 


with Xm, Ym the respective mth (vector) Fourier coefficients of x,y. 
The proof of Theorem 6 is an exact analog of that of Theorem 2, 
using the compact set 


{we Le"(—T,T): || tm |] S |] ym [| + ALKnja(r), all m} 
and with w(t) = w(2(t),t), the inequality, (analogous to (6),) 
[| wm || S NAC « 1), 
provable by observing first that for all ¢ 


IIA 


lA 


ay | w(t) | < N? || w(t) | 


< NX] 2(2) ID), 
so that trivially 
| wi(t) | S M*AC| e(e) II) 
and by concavity of \(-), 


sp | lela < WN 2M). 
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Squaring both sides and summing overj = 1, --- , N we obtain 


| wm IP = N°*(| # |}). 
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Equivalence Relations among Spherical 
Mirror Optical Resonators 


By J. P. GORDON and H. KOGELNIK 
(Manuscript received May 11, 1964) 


The frequencies, field patterns, and losses of the resonant modes of spheri- 
cal mirror optical resonators can be obtained to good accuracy as the solu- 
tions of the integral equations of Fresnel diffraction theory. By a simple 
transformation of the variables and parameters of the integral equations, 
we have found certain families of resonators which have the same diffrac- 
tion loss at each mirror, and whose field patterns are scaled versions of 
each other. In the case of the infinite strip resonator, this reduces from five 
to three the number of parameters necessary to specify the losses and mode 
patterns. 


J. INTRODUCTION 


The resonant frequencies, field patterns, and losses of the modes of 
spherical mirror optical resonators can be obtained to good accuracy as 
the solutions of the integral equations of Fresnel diffraction theory.! 
The equations are particularly applicable when the separation between 
the two mirrors forming the resonator is large compared with the di- 
mensions of the mirrors. Unfortunately, the equations are usually not 
soluble analytically, and require numerical (machine) computation. 
There are many parameters involved: the dimensions and curvatures 
of the mirrors and their separation. By a simple transformation of the 
variables and parameters of the integral equations, we have found 
certain families of resonators which have the same diffraction loss at 
each mirror, and whose field patterns are scaled versions of each other. 
In the case of the infinite strip resonator, this reduces from five to three 
the number of parameters necessary to specify the losses and mode 
patterns. 


II. THE TRANSFORMATION 


The equations which determine the field patterns, resonant fre- 
quencies, and losses of an infinite strip resonator (see Fig. 1) are! 


2873 


2874 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 





Fig. 1 — Spherical mirror resonator with mirrors of curvature radii Ri and 
R, , and of widths 2a; and 2a2. Mirror spacing is d. 


_ vif" 
iui (21) = aad [. K (ay ’ te) U2 (a2) da, (1a) 


= ay 
xt be K (ae, 21) (21) day (1b) 


with the complex symmetric kernel 


YoU2 (22) = 


K(a1,%2) = K(a.,%1) = exp [—j(a/dd) (gaat + goxe — Qay2)]. (1c) 


Here 
gi = 1 —(d/R:;), t= 1, 2, 


the mirror separation is d, R, and R» are the radii of mirror curvature, 
2a1, 2a, are the corresponding mirrors widths, and X is the wavelength 
in the resonator medium. Also, w(21) is the (generally complex) nor- 
malized field distribution on the left-hand mirror of Fig. 1, while ue(2x2) 
is the normalized field distribution on the right-hand mirror. If the two 
functions are normalized so that 


a1 ag 
i | wi (a1) |? day = | | we(ae) |? dae (2) 
Ei. Pie 


then one notes” that the power reflection coefficient of the left mirror 


* According to (1b) a light beam with a field distribution wu: (x1) across the left 
mirror causes a field y2u2(x2) across the right mirror, Therefore, the power re- 
flected from this latter (perfectly reflecting) mirror is proportional to 
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is | v1 ? and the reflection coefficient of the right mirror is | y2 |’. There- 
fore the loss at the left mirror is 1 — | y: |’, the loss at the right mirror 
is 1 — ||’, and the round-trip loss is 1 — | y1y2 |. The condition for 
resonance is that yry2 exp [—727(d/d)] be real and positive. 

We presume on the weight of much experimental and _ theoreti- 
cal’ evidence that a sequence of solutions to (1) does exist. Suppose 
now that we have found a mode of some resonator; i.e., we have found 
a solution for u:(x,) and ue2(xz2) which satisfies (1) for one set of values 
of the five resonator parameters a; , d2 , g1 , gz and d, and have found the 
corresponding eigenvalues yi and y2.. Our present concern is to find a 
family of resonators, each of which will have a similar mode; that is, 
a mode with the same values of y; and y2 and with similar (scaled) 
eigenfunctions. For this purpose we rewrite (1) in terms of dimension- 
less variables and eigenfunctions by substituting 


“= aki, = 1, 2 (3) 
and 
vi(Es) = Ui(ay)-Vay , 7 = 1,2. (4) 


By this transformation we obtain a generalized set of integral equations 
for the modes of the resonator 


y(t) = VjiN [ dé: v2) K(é1, &2) (5a) 
+1 
yavaléa) = VIN [ds (&) KE, &) (5b) 
with the kernel 

K(&, &) = exp [—jaN (—26& + Gif’ + Gofr’)]. (Se) 

In (5) only three independent resonator parameters occur 
N = aa,/dd, (6a) 
Gi = gi(ai/az), (6b) 


| v2 [? | | tt2(x2) |? dae . 


—ao 


The power of the beam as it left the left mirror was, of course, proportional to 


a 
[ | eta(aey) |? dar. 


G1 
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Ge = g2(d2/a1). (6c) 


N is the Fresnel number of the resonator, while G and G2 are generalized 
g factors which describe the mirror curvatures.* A geometrical interpre- 
tation of the G’s is shown in Fig. 2. 

Note that the above transformation maintains the normalization of 
the eigenfunctions 


+1 +1 : 
[de |oe) P= [dee | Ce) | (7) 
and therefore the physical meaning of the eigenvalues y; and y2 . 


Ill. DISCUSSION 


The integral equations of any spherical mirror resonator can be 
transformed into the form of (5a, b, c), which describe completely the 








gay 
G, =a! 
Fa, 


Fig. 2 — Geometrical interpretation of Gi. 


mode patterns, diffraction losses and resonant frequencies. It is clear 
that two resonators have the same scaled eigenfunctions, the same 
diffraction losses at each mirror and corresponding resonant frequencies 
whenever they are described by the same characteristic parameters 
N, Gi, and G, . Two resonators are therefore equivalent if 


02/dd => Giyd2/dd = N (8a) 
gi(ti/d2) = Gi(di/d2) = Gi (8b) 
92(Q2/d1) = G2(@2/d,) = Ge (8c) 


where the overbar indicates the dimensions of a resonator equivalent to 
the original resonator. 
* Note added in proof: in recently published perturbation analyses of optical 


resonators, Gloge! and Streifer and Gamo5 have arrived at the same three resona- 
tor parameters. 
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Quantities which can be expressed in terms of N, G,, and G, also 
remain invariant when we change from one resonator to an equivalent 
one. Some of these are listed in Table I. They are used in later computa- 
tions. 

The set of characteristic parameters N, G,, and G, provides some 
insight into the behavior of resonators. The quantity 


¢ = GiGe = 9ige (9) 


may be called the “stability number.’ One finds from its value whether 
the resonator is intrinsically of the “stable” or “unstable” type.’® To 
be stable the resonator must satisfy 


0<G4@ <1. (10) 


TaBLE I—Some INVARIANTS OF EQUIVALENT RESONATORS 








Resonator Parameters Field Parameters 
(See Section III and Fig. 1) (See Section X and Fig. 3) 
(1.1) N = a1Q2/dd (2.1) aye/dz 
(1.2) Gi = gi(ai/az) (2.2) giz(ai/Z) 
(1.3) G2 = g2(a2/a1) (2.3) gz(z/a1) 
(1.4) G@ = gige = GiG, (2.4) GzJiz 
(1.5) di°gi/a2’g2 = Gi/Ge (2.5) a1"gie/2"gz 
(1.6)* gi(a1?2/Ad) — GN (2.6) giz(a1?/dz) 
(1.7)* g2(Ad/ax*) = G.N (2.7) gz(hz/a1?) 
(2.8) gz (a?/dz) 


* Quantities like 1.6-1.7 but with subscripts 1 and 2 interchanged are also in- 
variants. 


The quantity N is the well-known “Fresnel number.” For N > 1 the 
diffraction loss of stable resonators is typically very small indeed, and 
the increase of loss in crossing the boundary from a stable to an unstable 
type is abrupt. As N decreases toward unity, the loss of the stable 
resonators increases, and the boundary becomes less sharp until, as 
N <1, all resonators have high loss. 

Finally, at least for stable resonators with not too small Fresnel 
numbers, we can see that the mirror with the larger G has the smaller 
diffraction loss. From Ref. 6, or from Section VIII of this paper, we 
know that the radii* w; of the fundamental mode “spots” on the mirrors 
are related by 


wi /we = 92/91 . (11) 


* We use the word ‘“‘radius”’ here and later to mean half the width of the mode 
pattern, as defined in Section VIII. 
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We therefore have 
(wi/ar)/(ws/a2) = gra2/gia’ = Go/Gs . (12) 


The ratio w;/a; between spot radius and mirror half-width can be taken 
as a measure for the diffraction loss at a mirror. According to (12) 
the mirror with the smaller G; has the larger ratio w;/a;, and thus the 
larger loss. In the special case Gi = G., one sees that (5a) and (5b) 
become identical, and so the diffraction losses must be equal. 


IV. SPECIAL ADDITIONAL EQUIVALENCES 


There are two previously known!:* special equivalences which exist 
in addition to the new ones we have been discussing. These are: 

(a) reversal of sign of both g; and go 

(b) interchange of both g; and g:. , and a; and a, ; i.c., interchange of 

the mirrors. 
The first of these special equivalences changes the sign of both G, and 
G, and does not alter N. This equivalence results because the allowed 
field patterns split up into those of odd and even symmetry in the 2’s.’ 
The equivalent field patterns are complex conjugates of the old ones, 
but the losses are unchanged. 

The second special equivalence corresponds to an interchange of the 
two mirrors. It leaves G? and N unchanged, but interchanges G and 
G,. It also obviously interchanges the mode patterns and the losses 
of the two mirrors. Combined with the equivalence relations which we 
have discussed before, this interchanging of the two mirrors means that 
two resonators are also equivalent if 


N=WN (13a) 

Gi = Ge (13b) 
and 

Go = G. (13) 


From these relations one deduces some rather curious equivalent resona- 
tor pairs if one postulates that the mirror curvature should be left 
unchanged (g; = g1 and g2 = ge) and only the apertures a; and a, varied 
to form an equivalent resonator. With (13) one finds that 


= ao(gi/g2)? (14a) 
an(g2/91)° (14b) 


= 


=) 
iS) 
I 
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is necessary for equivalence. Note that the equivalent resonator was 
found simply by changing the mirror apertures. The mode pattern that 
appears on the left mirror of this new resonator is a scaled version of the 
pattern that appeared on the right mirror of the original resonator, 
and the pattern that was on the left is switched to the right mirror of 
the resonator. 

Similarly, one obtains a pair of resonators equivalent in the above 
sense when the mirror apertures are kept constant and the curvatures 
are changed in accordance with (18). 


V. THE CONFOCAL RESONATOR 


The resonator commonly known as ‘the’ confocal resonator is 
actually a very special confocal* resonator for which Ri = R, = d, 
and hence g; = ge = 0, and G, = G, = 0. All of the equivalence trans- 
formations we have mentioned transform one confocal resonator into 
another. Our relations bring out the known fact that the losses and field 
patterns (apart from scale factors) of the confocal resonator depend 
only on the Fresnel number N and not at all on the ratio of the mirror 
apertures.” 


VI. RESONATORS WITH EITHER Gi OR Ge EQUAL TO ZERO 


When, in a system with mirrors of unequal curvature, the mirror 
spacing is equal to the radius of curvature of one of the mirrors, then 
one of the g’s is zero and we have G; = 0, or G, = 0. Let go = Gz = 0. 
As a transformation to an equivalent resonator leaves G2 invariant, we 
have for the equivalent resonator gj. = 0. In the stability diagram,’ 
which shows the stable and unstable resonator regions versus g; and 
gz , our transformation yields equivalent resonators that are represented 
by points on a straight line (in the general case, one has a branch of a 
hyperbola gig2 = const). 

The parameters of equivalent resonators with G. = 0 are related by 


G, = Gi = gi(a;/a2) = Gi(Gi/Ge). (15) 


This relation allows one to find for each resonator with g. = 0 and 
unequal apertures an equivalent resonator with g. = 0 and equal aper- 
tures, which is discussed in Ref. 1. Resonators of the former type have 

* Any resonator whose mirrors have coincident foci may be termed confocal, 


whether or not the mirrors have equal curvature. As has been noted only ‘‘the’’ 
confocal resonator is a low-loss resonator. 
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been of interest for the selection of transverse modes in optical maser 
oscillators.’ The mode selection properties of equivalent resonators are, 
of course, the same. For a resonator formed by a spherical mirror and 
a small plane mirror at its center of curvature’ (g. = 0) one finds equiva- 
lent resonators of equal mirror apertures which are the closer to the 
confocal resonator the smaller the flat mirror [compare (15)]. As it 
appears that the confocal resonator has the best mode selection prop- 
erties of all spherical mirror resonators, the above behavior would imply 
that reducing the size of the flat mirror will improve the mode selectivity 
of the above system. T. Li’ has indeed found this to be so on the basis 
of computer calculations. 


VII. RESONATORS WITH RECTANGULAR OR CIRCULAR MIRRORS 


The integral equations which determine the modes of resonators with 
rectangular mirrors decompose into two sets of equations identical to 
(1), each set involving a single one of the two transverse Cartesian 
coordinates.!»® Hence all of the above applies immediately to such 
resonators, including resonators with astigmatic mirrors, provided the 
principal directions of the astigmatism are parallel to the edges of the 
mirrors. 

Equivalent families of resonators with circular mirrors can also easily 
be found by a similar method, starting from the appropriate integral 
equations which are indicated in the Appendix. The resulting parameters 
are of the same form as (6), but with the a; now redefined as the radii 
of the mirrors. 


VII. DETERMINATION OF SPOT RADII 


If the apertures of the mirrors are sufficiently large, 1.e., if N > 1, 
and if G’ is not too close to 0 or 1, then the field patterns of the modes 
approach closely to Hermite Gaussian functions and lose their depend- 
ence on the apertures. Then one can define a “‘spot size’’, or spot ra- 
dius,°* where the Gaussian part of the function has dropped to e™ of 
its maximum. In the transformations among equivalent resonators, the 
mode patterns scale in proportion to the apertures; hence two other 
invariants of equivalent resonators are obtained by replacing a, and 
d2 in (6b) and (6c) with the spot radii w; and w.. Now any quantity 
which is an invariant of equivalent resonators must be expressible as a 
function of the basic parameters N, G, and G, . But since the values of 
N and G,/G2 , which depend on the apertures, do not influence the spot 
radii, these two invariants of the equivalence transformations can be 
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functionally dependent only on G’? = G,G,. Hence we obtain the rela- 
tions 


W1W2/dd = t(@’) (16a) 
(w1/w2)?(g/ gz) = 1. (16b) 


Equation (16b) follows, since we know that for mirrors of equal curva- 
ture (and hence with g; = gz), the spot radii are also equal. The function 
f(G’) on the right side of (16a) may be evaluated by comparison with 
a known result for mirrors of equal curvature 


(91 = go = 9; Wi = W2 = Ww). 


Equation (27) of Ref. 8 can be conveniently expressed in our present 
notation as 


w/rd = (1/r)(1 — g°)* (17) 

which, on comparison with (16a), identifies f(G’) as 
HEY = C/G). (18) 
Equations (16a) and (16b) can be rewritten with the help of (18) as 
w/w, = (g2/gr)’ (19a) 
wiv, = (dd/r)(1 = giga). (19b) 


These last equations are identical with (39) and (40) of Ref. 6 and 
together determine the two spot radii. Their derivation here is included 
because of its relative simplicity, and as an example of the use of the 
invariants. 


IX. FACTORS OF THE GENERAL TRANSFORMATION 


Given the parameters (dimensions and curvatures) of one resonator, 
specification of d and g, for an equivalent resonator completely deter- 
mines all parameters of the equivalent resonator, apart from the special 
equivalences discussed in Section IV. The general transformation from 
the original to the equivalent resonator can be factored into a succession 
(product) of two simpler transformations, in the first of which a is 
changed but gi is not, followed by a second for which g; is changed but 
Q, 1s not. 

The first of these simpler transformations effects a rather simple 
squeezing of all resonator dimensions, all transverse dimensions (aper- 
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tures) being multiplied by the same factor e, say, while all longitudinal 
dimensions (radii of curvature, mirror separation) are multiplied by 
é. To see this we note that R, and d must change proportionally to 
leave g, unchanged. R, must change in proportion with these because 
of the invariance of gig2, ie., of G’. Finally, a, and a, must change 
proportionally to leave G, invariant, and they must change as d* to 
leave N invariant. 

The second simpler transformation leaves the aperture a; unchanged. 
Suppose it changes the radius of curvature R; in accordance with the 
relation 


1/Ri = (1/Rx) + (1/f). (20) 


In practice a thin lens of focal length f inserted directly in front of the 
mirror can produce such a transformation. By using the invariants 
NG, N/G:, and N [listed as (1.6), (1.7), and (1.1) of Table J] in suc- 
cession, one can derive the following relations between the parameters 
of the transformed and original resonators 


1/d = (1/d) + (1/f) (21a) 
1/(d — Re) = [1/(d — Re)] + (1/f) (21b) 
G/d = a,/d. (21¢) 


Equations (21a), (21b) and (21c) show respectively that the position, 
center of curvature, and aperture of the original second mirror are 
changed to those of the new one by imaging them through the lens. 
In this imaging process, objects on the side of the lens toward the second 
mirror are taken as virtual objects, while objects on the other side of 
the lens are taken as real objects. 


xX. TRANSFORMATION OF THE FIELD INSIDE AND OUTSIDE THE RESONATOR 


The mode patterns on the mirrors of two equivalent resonators are 
scaled versions of each other, and one expects also a correspondence of 
the fields of a mode inside and outside the equivalent systems. This 
correspondence is studied in this section. 

With the assumptions of the diffraction theory of optical resonators 
the fields inside or outside the resonator structure can be expressed in 
terms of the field pattern on one of the mirrors via Fresnel’s formula. 
For fields independent of y (this restriction can be removed easily; 
compare Appendix) we have for the field traveling to the right, say, 
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u(a,z) = vi ' da, Uy (ay ) 0) 
eZ way, 
(22) 
“exp | -1 7 (gietv + 9.2 — 2a) | 


where 1;(21 , 0) is the given field pattern on the left mirror and u(2, z) 
is the field on a spherical references surface that intersects the optics 
axis a distance z away from the mirror (see Fig. 3). The quantities 


Q1z 1 -— (2/R1) (23a) 
gz = 1 — (2/R) (23b) 


are again used to describe the curvatures of the mirror (curvature 
radius R,), and that of the reference surface (curvature radius FP). 
The mirror width is 2a, . 

The transformation to an equivalent resonator changes the aperture 
and curvature of the mirror under consideration, and scales the field 
pattern on it accordingly, i.e., if 


l 


a, —> ay (24a) 
nh (24b) 
then 
Vay t(a1, 0) = Wa; t1(H , 0) (24c) 
where 
21/0, = %/d. (24d) 
: . 






z me i By 
\ 


REFERENCE | 
SURFACE 


Fig. 3 — Reference surface for description of the fields inside the resonator. 
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We seek a new surface, described by 2, g, (or R), on which a scaled 
“image” u(, Z) of u(x, z) will be found. To do this we find, just as in 
Section II, a set of invariants necessary so that (22) retains its form in 
the transformed parameters. Essentially these invariants are the three 
terms in the exponential of (22), with the added fact that since 1 
transforms like a; we replace 2; in those terms by a;. By manipulation 
we obtain the set of invariants listed as (2.1) through (2.8) in Table I. 
The terms (2.6), (2.8), and (2.1) come directly from the three terms 
in the exponential of (22); the others may be derived from them. 
Finally, the transformed function is given by 


(z/ay)*u(a, 2) = (2/di)*a(4, 2). (25) 


From this set of invariants one can find the new position (2), curvature 
(g.) and transverse scale factor (#/x) of the scaled function. First, 
consider the position. The invariant (2.6) may be expanded, using 


(23a), as 
2 2 
ay 1 _ 1 _ a d a 
nN (: x) ~ hd (! as n) om 


But now the term (a;9g:/Ad) is itself an invariant (1.6, Table 1) of the 
resonator transformation, and hence the remaining part of (26), i.e., 


Gre 

dy Z 

also forms an invariant. Finally we can simplify this a bit by dividing 
by N(1.1, Table I) to yield the invariant 


aCz) a 


We sce that the ratio z/(d — z) transforms like a;/a,. From (27), we 
obtain the equation from which the new position Z may be derived 


ede) a8) 


Once we have found the new position, the new transverse scale factor 
and curvature may be found most easily using the invariants (2.1) 
and (2.4), respectively, of Table I; i.e., 


(@/x) = (2/z2)(a1/d1) (29) 











and 
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Gz = G42! rz (30) 


and the scaled function is as given in (25). 

To provide a more physical picture of the field transformation, it is 
interesting to note that the simple squeezing and imaging transforma- 
tions discussed in Section IX apply to the arbitrary reference surface 
and its field as well as to the second mirror and its field. 

Finally we note that the transformations of Fresnel’s formula we have 
been discussing do not depend on the fact that w(21, 0) is an eigen- 
function of a resonator. The preceding discussion, with the exception of 
the derivation of (26), all applies equally well to the fields generated 
by any prescribed field distribution over an aperture. 
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APPENDIX 


Resonators with Spherical Mirrors of General Shape 


Within the assumptions of the theory of optical resonators'~® the 
modes of a resonator formed by two spherical mirrors of quite general 
shape are governed by the integral equations 


oy (21, 1) = Jf dA2:K (a1, X Yi, Yo) - Ho ( xe , Yo) (31) 
Ag 


and 
O2l2( x2 , Y2) = a) dA,:K (21, x2 Yi, yo) -Ey(a1, Yi) (32) 
Al 


with the kernel 


K(x »%2 541, Yo) 
= exp{ at) slg (a + yp) + go(ae” + ye") — 2(am + wl}, (33) 


Here (a1, y:) and (a2, y2) are coordinates in planes perpendicular to 
the optic axis, d is the mirror separation, and g; and g2 describe the mirror 
curvatures as in Section IT. Subscript “‘1”’ indicates quantities associated 
with the mirror on the left-hand side, and ‘‘2” refers to the mirror on 
the right. o1 and oe are the eigenvalues corresponding to y; and 2 dis- 
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cussed in Section IJ. The integration has to be performed over the re- 
flecting areas A, and A, of the mirrors where dA; and dA, are the area 
elements. No assumptions on the curves bounding the reflecting areas 
have been made, and the formulation (31) and (32) includes mirrors 
of quite general shape. Special cases are, of course, strip mirrors, square 
mirrors, rectangular mirrors, and mirrors of circular shape. These are 
of main practical interest. 

Let us compare (31), (82) and (33) with (la), (1b), and (1c) of 
Section II. It is clear that the discussion of two-dimensional resonators 
systems given in Section II can be extended to the three-dimensional 
case in which we are interested now. The only difference is that we now 
have two transverse coordinates (x, y). If they are subjected to the 
transformation 


C= E10; 5 Yi = €xYi 5 c= 1, 2 (34) 


and the mirror areas and area elements are scaled like 


A; = ec A; ; dA; = e dA; (35) 


then the mirror curvatures and the mirror separation of two equivalent 
resonators are related by the same invariants as before. All we have to 
do is to replace a,’ by A; in the table of invariants. For the special case 
of circular mirrors, a; can be redefined as the mirror radius and retained 
in the invariance relations. 

Note that we have used the same scaling factors e; for the « and y 
coordinates. If different scaling factors are used one obtains, of couse, 
equivalent resonators with mirrors that are not spherical but astigmatic. 
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Modes in a Sequence of Thick Astigmatic 
Lens-Like Focusers 
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M axwell’s equations are solved for a periodic sequence of lens-like focusers 
separated by gaps. Each focuser consists of an arbitrarily thick slab of 
dielectric in which the dielectric constant tapers off radially with different 
quadratic laws in two perpendicular directions. Since there are no limita- 
tions on the thickness of the slabs, the solutions cover the complete gamut 
from a sequence of infinitely thin lenses with astigmatism to a continuous 
dielectric waveguide, and from spherical to cylindrical lenses. 

The field configurations of the modes and their propagation constants, as 
well as the transmission and cutoff bands, are calculated. Any arbitrary 
input field distribution can then be expanded in terms of the normal modes, 
and the expansion determines the field everywhere. 

Formulas derived for sequences of weak lenses turn out to give very good 
results even for lenses whose thickness and separation are equal to the focal 
length. 


I. INTRODUCTION 


One possible long distance transmission.medium for optical waves 
consists of a periodic sequence of converging lenses. In order to negotiate 
unwanted but unavoidable bends of the axis of the sequence it is neces- 
sary to space the lenses as closely as possible.! Nevertheless, ordinary 
dielectric lenses exhibit substantial surface scattering, and therefore 
the minimum spacing between lenses depends on the tolerable transmis- 
sion loss. 

D. W. Berreman has shown that an effective lens can be made using 
gas with thermal gradients,?? thus avoiding the solid-to-gas transition 
problems. D. W. Berreman and 8. Ii. Miller? proposed a gaseous lens 
consisting of a tube with hot walls through which a mild gas current at 
lower temperature is forced to flow. At any cross section the tempera- 
ture increases from the center to the wall. The density and consequently 
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the dielectric constant is then maximum on the axis and decreases 
radially roughly with a square law. Without the problem of scattering 
at the interfaces, tubular gas lenses can be closely spaced and the gaps 
may be comparable to the thickness of the lenses. 

The advent of such a new transmission medium makes it opportune 
and important to generalize the theory of modes in a sequence of thin 
lenses by determining the normal modes in an idealized structure which 
consists of a periodic sequence of arbitrarily thick slabs of dielectric 
whose dielectric constant tapers off radially with quadratic law. 

The preferential direction of gravity creates convection currents that 
may introduce astigmatism in the gaseous lenses. Such an aberration 
is included in our model by making the radial quadratic law of the 
dielectric different in two perpendicular directions. 

We calculate the modes of propagation of the idealized structure 
without including the solid walls surrounding the medium. Taking them 
into account would perturb the modes only slightly, introducing diffrac- 
tion losses. Just as in the case of a waveguide with perfect metallic 
walls, the idealized modes considered here are not attenuated, but their 
discussion is similarly expected to be useful in approximating: (a) the 
propagation constants: (b) the range of dimensions over which trans- 
mission is permitted or forbidden; (c) the extent of mode conversion at 
discontinuities or imperfections; and (d) the field at any point due to an 
arbitrary input such as an off-axis or tilted beam. Of these, (a) and (b) 
are treated in this article. 

The calculations are general enough that by changing the lens param- 
eters and the length of the gaps it is possible to cover uninterruptedly all 
the range from a sequence of thin lenses*:§.75 to a continuous dielectric 
guide,!?!° and from spherical to cylindrical lenses. Up to now only the 
extreme cases, that is, thin lenses or dielectric guide and spherical or 
cylindrical lenses, have been considered in the literature; this article 
bridges the gaps. 


II. CPESCRIPTION OF THE PROBLEM 


Consider a periodic sequence of dielectric slabs, shown in Fig. 1. 
The refractive index v of each slab is independent of z, but varies with 
different quadratic laws in the x and y directions as 


Kol @-@) 


The refractive index n on the z axis and the characteristic parameters 
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Fig. 1 — Periodic sequence of arbitrarily thick and astigmatic lenses. 


of the lens-like medium, [1 and Lz , permit adjustment of the parabolic 
distributions. The physical significance of L; and ZL, will be treated 
below. 

In spite of the fact that it reaches negative values for large x or y, 
this dielectric distribution is useful because it matches the dielectric 
distribution of the gaseous lens, especially for small values of r2/Ly 
and wy/L. . Besides, it turns out that the field of most modes is negligi- 
ble in the region where the dielectric constant is small or negative, and 
consequently that region does not contribute essentially to the guidance 
of the modes. 

In the Appendix we solve approximately Maxwell’s equations. The 
sequence of lens-like focusers supports hybrid modes EH,,, charac- 
terized by the indexes p and q. These integers indicate that the intensity 
of each transverse field component passes through p zeros in the x 
direction and q zeros in the y direction. 

The only approximation in the solution of Maxwell’s equations 
consists in neglecting terms of the order of p\/Z, and g/L, compared 
to unity. \ is the free-space wavelength. Typically \/Z; and /Lz, are 
of the order of 10-°; therefore, except for very high-order modes (p 
and/or q very large), the results must be satisfactorily precise. 

The modes have no electric field in the y direction nor magnetic field 
in the x direction. The remaining components — E,, E,, H, and H, 
in the dielectric slabs and E,, , Ez, , Hy, and H,, in the gaps — are found 
assuming as normal modes only those field configurations that repeat 
themselves periodically at each lens. Therefore the equiphase surfaces 
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of each mode are planes at 2 = 0 and ¢ = 0, as shown in Fig. 1. We 
reproduce here only E, (55) and E,, (58). 

All the other components can be deduced from them with the help 
of (45). 


2 2 2 
eS at ip ee ee a! 1 a1 OL ae 
Ii, = exp 1 Ei (= oR, i) (» + ,) tan (% tan 7) 
1 1 [w” TZ a \? y\ 
— (a +5) taunt (3 ton 2) | - (2) - (2) 2) 
‘Hp (v2 2) H, (v2 ¥); 
p1 p2 
E ty x y 1 ae 
= exp{—i[é(¢ 55 — 5H) — (p+ 5) tan hesgi? 
(ota) 2a] -(Z) - 2] : 
— =)t re ey een 3 
(« x 5) sae k8gx" Pol Pg2 ( ) 


(vag) m(ve3) 


where [see (55) to (67)], k = wr/equ = 27/d is the free-space prop- 
agation constant and H,(a@) is the Hermite polynomial of order u. 

The physical significance of the symbols s;, 82, Sg1, Sg2, Ri, Re, ete. 
will be developed below. We give first their mathematical meaning and 
in order to avoid repetition, from now on the letter m will stand for 
either the subindex 1 or 2, depending on whether the symbol under 
consideration refers to a dimension in the plane y = 0 or x = O respec- 
tively. Calling the thickness of each dielectric slab ¢, and the gap between 
them }, 














a 14+ °C, ctn en) 

ta tn (Fe 
Sam = Wm(L + Cn etn gm)'(L — Cn tan om) (5) 

1 Lim 
nay Vee (6) 

T n 

ax Ob 

Cn = 57. 7) 


(8) 
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+-: cin —— (9) 


om = on f/ } fi : (ay +[1- ey vos 272) (10) 
2 Sm Sm Ln 

Ke Som or yr 

Rom = Ze F 3 (24) | (11) 


~~ Oe \2 
pon = bon f/ 1 + (5) (12) 


Let us find the physical significance of Rm, Rgm Pm Pom Sm Sqm, 
Wm and Lm . Equating in (2) and (38) the imaginary parts of the expo- 
nents to constants we obtain two equations of equiphase surfaces (wave- 
fronts), one applicable within a lens and the other in a gap. At the z 
axis, each wavefront has a radius of curvature in the plane y = 0 which 
in general is different from that in the plane x = 0. Within a lens those 
main radii of curvature are R; and R, [see (9)], while those in a gap are 
Ry» and R,» [see (11)]. If L, = L,, then Ri = Reand Ry = Ry. 

For the fundamental mode p = q = 0, at a given abscissa z or ¢ the 
field amplitudes (2) and (3) decrease with different Gaussian laws in 
the x and y directions. The distances at which the field is 1/e of the 
maximum occurring on the z axis are the beam sizes p; and p; [see (10)] 
within a lens, and p,i and p,2 [see (12)] in a gap. 

For z = 0 and ¢ = 0 we find from (10) and (12) that pn = s» and 
Pam = Sgm. Therefore sm and sym are the beam sizes at the planes of 
symmetry of each lens and each gap respectively. 

The physical significance of w» becomes obvious on reducing the gaps 
between lenses to zero. Then instead of a sequence of lenses we have an 
uninterrupted dielectric waveguide and we derive from (7), (4), (5) 
(10) and (12) that 





Pm = Pgm = Sm = Sgn = Wn. (13) 

Therefore in the continuous guide the propagating normal modes do 

not change size along z, and for the fundamental mode w; and wz measure 
the beam sizes in the x and y directions. 

From (10) we find that within a lens the beam sizes pi and p, in the 

y = O and x = 0 planes vary periodically along z; their periods are LZ; 
and L» respectively. 

For the particular case in which L, = Ly the field in the gap (3) 
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coincides with that found by Boyd and Gordon’ for the resonator made 
with confocal mirrors of infinite aperture. 


III. TRANSMISSION AND CUTOFF CONDITIONS 


Both s,, [see (4)] and sym [see (5)] must be real quantities, otherwise 
the fields given in (2) and (3) become infinite as x or y > ©. This 
establishes that a mode can propagate in the sequence of lenses either 
when 


Cm S ctn om (14) 
or when 
Cm S —tangn. (15) 
Their equivalents in explicit form are 
b S (2Ln/nr) etn (rt/2Dm) (16) 
and 
b S —(2L2n/nr) tan (at/2Dm). (17) 


Which equation must we use? Since b and L», are positive, (14) or (16) 
must be used when gm = 7t/2Lm falls in an odd quadrant and (15) or 
(17) when it falls in an even quadrant. Naturally, if these equations are 
satisfied for only one of the two indexes, that sequence of lenses cannot 
propagate any nonattenuating mode. 

If b = 0, the sequence of lenses is reduced to a continuous waveguide 
and transmission takes place, as it must, no matter what the values 
of yg; and g, are. If now we increase the gap b, transmission will take 
place as long as (16) or (17) is satisfied. 


IV. DISCUSSION OF THE FIELD INSIDE AND OUTSIDE THE LENSES 


The sequence of lenses admits a complete set of modes. For each mode, 
the field inside (2) and outside (8) the lenses is a wave traveling in the 
z direction whose amplitude, period and equiphase surfaces (wave- 
fronts) vary along z. 

The amplitude depends on « as a product of a Gaussian function and 
a Hermite polynomial (parabolic cylinder function) whose degree 
depends on the mode under consideration. A similar type of variation 
occurs along y. 

In Fig. 2 we plot qualitatively the beam sizes p, and pgm for gm = 
at/2Lm in the first, second and third quadrants. For ¢,, in an odd quad- 
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rant, as in Figs. 2(a) and 2(c), the maximum and minimum beam sizes 
within each lens are 


eh 1+ C,, ctn =) 
Pmmax = Sm = Wn {+O sien (18) 
and 
2 2 
es 5 1 — C,, tan ¢n\* 
Pmmin = ae mi tn {7 Gn tan en) . (19) 


The period between two successive maxima is L,, . The square root of 
the product of the maximum and minimum beam sizes in the dielectric 
is a constant 

(pm max Pm ey = Wn 


and coincides with the beam size w» of the lens-like medium. 
In the gap, the only extremum for the beam size is a single minimum 
which occurs at ¢ = 0 and, from (5) and (12), corresponds to 


Pgm min = Sgm = Wm(1 + Cn ctn Om) (1 2 Ce tan om)*. (20) 


If Om = mt/2Lm falls in an even quadrant, as in Fig. 2(b), the mini- 
mum and maximum beam sizes interchanged from the odd quadrant 
are (18) and (19) respectively. Again (20) corresponds to the unique 
minimum in each gap. 


V. SPECIAL CASES 
Let us consider the field in a gap assuming 
i=l, = L 
and 
(/L=n or g = @ = n(x/2) (21) 


where 7 is an integer. Then unless the gap b = 0, the minimum beam 
size in the gap pgm min (20) becomes infinitely large and the electric 
field (3) is reduced to a plane wave travelling in the z direction. If more 
generally only g; = (2/2), but g is unrestricted, then the wave fronts 
are cylindrical surfaces parallel to the x axis. 

Consider again 


Ly => Ly = L 
but 





| 
| 


Q & < 

| _ &_ 
| | - 
Lag ¢-—»| = 


| 
ecu 


L_ 


je 


oS 
uu 


Vln an 


a Lh 
| 
! 
| 
a 

&<— at §<— 

2 | 
| 
! 
| 


zy & 


lo 
TS 
| 


oN 


2 
we 


~~. 

~ 

Pig 
fo} 


~~ 


eam size 


2; (c) 


ie} 

5 

| Eo 
* ae 


size for gm = = Z,, in second qua rant, 1 
m 





MODES IN LENS-LIKE FOCUSERS 2895 





Cy = C, = cthng, = ctng (22) 
or 

Cy = C, = —tang,; = —tang. (23) 
Then according to (20) the minimum beam size pg1 min = Pg2 min = O 
and the field in the gap (3), for p = gq = 0, becomes 

Bq = exp —ike (1 se v _ u). (24) 


The wavefronts close to the ¢ axis are concentric spheres and their 
centers coincide with the point x = y = ¢ = 0. 

Therefore the two conditions indicated above correspond either to 
plane waves in the gap or to concentric waves (if one observes only the 
field in the region close to the ¢ axis). They are equivalent to those in 
Fabry-Perot resonators with plane and concentric mirrors.”°"’ 

The condition under which the beam is closely concentrated on the 
z axis is found by minimizing the maximum beam size within a lens, 
Sm [see (18)] or Wn’ /Sm [see (19)] depending on whether yg,» is in an odd 
or even quadrant. 

If the gap b decreases, the value of sm OF Wm /Sm also decreases; for 
b = 0 the sequence of lenses becomes a dielectric waveguide, the beam 
size does not vary with z, and its value is sm = Wm /8m = Wm. On the 
other hand, if the thickness ¢ of each lens is the only variable, the mini- 
mum Of Sm OF Wm /Sm is achieved by making 





OS a OS 0 if gy» Is an odd quadrant 
Ol OGm 
or (25) 
01 0 oi 


——=—-—=90 if y,, is in an even quadrant. 


These conditions lead to the same requirement, namely: 
Cm = ctn 2on (26) 
or its equivalent 
b = (2L,,/nr) ctn (rt/Lm) (27) 


which, replaced in (18) or (19), determines the minimized value of the 
maximum beam size within each lens 


Sm min = (Wn /Sm) min = Wml(1 ++ Ci) “tf Cul. (28) 
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For the same condition (26) or (27), the beam size in the gap at any 
abscissa ¢ is derived from (5), (12) and (26) 


wc sor tren] 


VI. SEQUENCE OF WEAK ASTIGMATIC LENSES 


Before considering weak lenses, let us relate the characteristic lengths, 
I, and Ly of the lens-like focusers to their focal lengths in the planes 

= QO and x = 0. To calculate the focal length in the y = 0 plane (see 
Fig. 3) the ray trajectory is determined from the equation 


dx/dz = (1/v) (dv/dx). (30) 


Taking the refractive index v from (1) 


dx _ 1 d 2 
“a =e ey ee os 


For paraxial rays 


and within a lens the trajectory of a ray entering parallel to the z axis 
at a distance 2% is 


x = 2 cos (w2/Ih). 
The angle of refraction at the output surface is 
= (nr/Iy)x sin (at/Iy). (33) 
Peane 


t peaareas 
ij eS = | 
i | 

\ 


__pS 


Fig. 3 — Ray trajectory in the plane y = 0 of a lens. 
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Then from simple geometric considerations in Fig. 3, the focal length 
fi results 
I, 


an sin (at/Ly) © (34) 


i 
We assume now weak lenses. They are characterized by 


Pn = (a/2) (t/Lm) <K 1, (35) 


and in all previous results each circular function can be replaced by its 
leading term. 

Because of the inequality (35) the characteristic length of the focusing 
medium JL, in (384) can be calculated explicitly by 


Ty = TV ntfy . (36) 
Similarly, for the plane x = 0 
Ty = TY nto. (37) 


The weak lens requirement (35) then becomes 


a ie 
Using (36) and (37) together with the simplifying assumption (38) we 
re-evaluate the maximum and minimum beam sizes (18), (20) for 
weak lenses (¢» in first quadrant), 





YO - — w 


The distance h between the principal planes may be of interest. Using 
(33) and (34), this distance turns out to be 


2Lim at 
Expanding the circular function in series, keeping only the first two 
terms and substituting L,, by their equivalents (36) and (37) we ob- 
tain, 


h =:(4-1)+ 55 (42) 
ve n 12n?fin 
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6.1 Example 
Let us assume a sequence of gaseous lenses such that 
b=t=fi= fe = 025m 
= 0.6328 10° m 
nw. 


For these dimensions ¢; = g = 0.5, and therefore the weak lens in- 
equality (38) is hardly satisfied. Nevertheless, let us go ahead and 
calculate extreme beam sizes s; = Ss. and 8,1; = 8,2 as well as the charac- 
teristic length L, = Le of the lens using (39), (40) and (36) 


8, = 8. = 0.286 mm 
Sql = Sy = 0.248 mm (43) 
Ly = L, = 0.785 m. 


Let us calculate again the extreme beam sizes using the exact expres- 
sions (18), (20), deriving Z from (34) 


8, = S& = 0.276 mm 
Sai = Sgo = 0.224 mm (44) 
Ly, = Ly, = 0.704 m. 


Il 


The two sets of results (43) and (44) are reasonably similar and show 
the usefulness of weak lens formulas even for lenses with comparable 
values of t and f. 


VII. CONCLUSIONS 


The properties of the modes in a sequence of thick, astigmatic and 
unbounded lens-like focusers are similar to those in a sequence of thin 
infinitely large lenses. 

The modes are hybrid and described by parabolic cylinder functions 
(product of Gaussians times Hermite polynomials). Transmission takes 
place as long as the gap between lenses is smaller than a value given in 
(16) or (17). 

The maximum beam size can be reduced by decreasing the distance 
between dielectric slabs. Nevertheless, if the gap is fixed, the minimiza- 
tion of the maximum beam size can be obtained by selecting the dielec- 
tric properties or the thickness of each focuser according to (27). 

Simplified formulas derived for sequences of weak lenses yield good 
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approximations even for lenses whose thickness, separation and focal 
length are comparable. 
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APPENDIX 


Solution of Maxwell’s Equations in a Sequence of Thick Astigmatic Lenses 


We will obtain, first, a general enough solution of Maxwell’s equations 
for one of the lenses; see Fig. 1. Then by making n = 1 and LZ; = L, > «, 
we will deduce a general solution for the uniform gap between lenses, 
and finally we will match the tangential fields to satisfy the boundary 
conditions. For modes with only four field components, E, , E,, H, and 
H,, Maxwell’s equations become 


On, dB. 














Oz an jou 
Ok, = —jwpH, 
dy 
(45) 
a) ere 
ay ae ea 
oy = —jweH, 


where yu, the magnetic permeability, is a constant; 


<[-G-@] 


e, Ly and Ly are arbitrary constants; and w\/eu = 27/A = k is the 
free-space propagation constant. 

By eliminating variables and by neglecting terms of the order of 
\/L, and \/L,* as compared to unity we obtain identical equations for 
E, and H,. For E, , 


OL, OE, OE, 2 | (e) (7) | = 


* In practice A/L and A/Le are of the order of 1075. 
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This equation is separable and a general solution is 
2 2 
nol -()-(0)] 
exp | (2 u 
—_ w+ti1nr eee 
, = eee ee 4 
x Ds Dy Ay exp | iknz (1 - a Sa ») (48) 
W1 We 


where v and yp are integers, and A,, is an arbitrary constant. Using m 
to indicate either subscript 1 or 2, 


try = 2 4/M, (49) 
vie n 


H(t) = (—1)’e” (d’/d?’)e* 


}? 





The function 


is the Hermite polynomial™ of order v. Hermite polynomials of lowest 
degree are Ho(t) = 1; Hi(€) = 2&; H.(£) = 4& — 2; and H,(£) = 
SE — 12¢, 
Expression (48) can be simplified provided that the important terms 
of the summation are those for which 
vr/ Ty <K 1 
and (50) 
bA/ Ly <K 1. 


Then the square root in the exponent can be replaced by the first two 
terms of a power series satis and 


By 2 exp (~ia — # ~ 4) ¥ 4, exp tiGeve/t tt. (Y*) ] 
x | DBs exp linve/Ta) Ht, (2) | 


A fl 1 


We will look for a periodic field configuration that reproduces itself 
at each lens. For reasons of symmetry, then, the planes of symmetry of 
the lenses (2 = 0) and gaps (¢ = 0) must be equiphase surfaces. 





(51) 


where 
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We choose the field at the plane z = 0 to be 


ss rae dere Lue, 


where p and q are integers and s, and s, are arbitrary parameters for 
the time being. Therefore for z = 0, we obtain from (51) 


Sn mo(- £8) (B) = m0 (-B)m (2B). ow 


Using the orthogonality properties of the Gaussian-Hermitian product,” 
we obtain 


Fa lol-#G+ a) 
Ay a —F(— + — 
VJ 12"v! wy ‘2 oe é sv T wr 
Hy, (8) 1 (3) de 
Wi 
and a similar expression for B,. Replacing the result in (51) and per- 
forming, as in Ref. 13, first the summation in v and uw and then the 


integration in &, the transverse field component inside a lens expressed 
in closed form results 


x y 1 
Ez = exp {1 in (< — Spe ts) = (» oe 3) 
2 
—1 { W1 wey 1 -1 We 
-tan (4 tan =) (« “+ 5) tan (2 tan =) | (55) 
2 2 
Pl p2 pL p2 


where, for m = 1 or 2 








and 





+ etn — (56) 


and 
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= 1 Wn ' Wm : : 2nz\ 57 
Pm Sm 3 i + (**) + E — (2=) | cos Gal 7 (5 ) 


The electric field in the uniform dielectric gap between two lenses 
can be derived from the previous expression by making 





n=1 
and 
1/Lm = 0 


and by substituting another symbol, sym, for Sm. Again we demand the 
plane of symmetry of the gap, 2 = (b + t)/2 (see Fig. 1), to be an 
equiphase surface. This is achieved by substituting ¢ = z — (b + t)/2 
for z. 

The electric field in the gap is then 


; x y° 1 ee @ 

Ez, = exp {-s «(+ —_ OR, _ z) — (» + ) tan iis,2 
or B]-@)-() 

_ ~|]t —{—) -—[(— 58 

(« uz s) = K8qo" Pg Pg2 ( ) 


wu(veg)ue(vag) 


Pg2 


sere (se) : 
Rom = 4¢ 1 Som? (59) 


/ 2 
Pgm = Sgm 1 + Zs) ° (60) 


To match the fields (55) and (58) at the interfaces, the x and y 
dependences of the field at both sides must coincide. The fact that it 
can be matched guarantees that Maxwell’s equations are satisfied 
simultaneously in lenses and gaps. It can be verified that if the tangential 
electric field continuity is satisfied, the tangential magnetic field con- 
tinuity is also guaranteed. By considering waves propagating in both 
directions, it could be possible to take into account reflections at the 
interfaces, but we shall instead assume that at each interface there is a 
matching mechanism that prevents reflections. Notice that in the case of 








where 





and 
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gaseous lenses the small changes of dielectric constants automatically 
insure negligible reflection at the interfaces. 

The exact matching of the fields at the interfaces is achieved by 
making equal the coefficients of x, y, 2” and y’ in both expressions (55) 
and (58) at the boundary z = ¢/2 of the lens and ¢ = —b/2 of the gap. 
Then 


Rime = 12) = Rome = 0/2) (61) 


Pm(z = t/2) = Pgm[t =—(/2)] - (62) 


From them, together with (56), (57), (59) and (60), we deduce the 
values of s, and s,, that guarantee the matching at the interfaces. 
They are: 


and 
Spm = Wn(l + Cm etn gm)? (1 — Cm tan gm)* (64) 
where 
Cm = (1/2) (b/Lm) (65) 
gm = (2/2) (t/Ln) (66) 
Wm = (1/7) YrALn/n- (67) 
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Substitution of Laminated Low-Carbon 
Steel for Silicon Steel in the Cores 


of Wire Spring Relays 


By WILLIAM C. SLAUSON 
(Manuscript received May 13, 1964) 


This article describes the analytical and laboratory studies undertaken to 
determine tf low-carbon steel could be substituted for the more expensive 
1 per cent silicon steel an the cores of general-purpose wire spring relays. Not 
only ts this silicon steel more costly, but its hardness characteristics are such 
that tool maintenance for manufacture 1s an appreciable item. It was found 
that this substitution can be made without degrading the performance of 
these relays, provided the new core 1s made up of two laminations. When 
two laminations are used, the eddy current time constant of low-carbon 
steel matches that of the silicon steel. This 1s necessary to achieve the fast 
operate and release times now obtained and to permit satisfactory operation 
of present circuils when the substitution 1s made. 

This substitution will result in substantial annual manufacturing savings 
for the general-purpose relays. These savings could be further increased if 
use of the new core could be extended to other, more special, relays of the 
wire spring relay family. These applications are now under study. 


I, INTRODUCTION 


The wire spring family of relays (see Fig. 1) was designed to serve as 
the basic components of modern switching circuits. It was first intro- 
duced in the No. 5 crossbar switching system and later in other systems, 
including a wide variety of switching applications for the Bell System. 
The design provides an electromagnetic device of high efficiency and re- 
liability with excellent operating characteristics and suited to a high 
degree of automation in manufacture. The relays are obtainable in a wide 
variety of codes with different coil resistances and are capable of con- 
trolling from 1 to 24 contact sets per relay in various combinations of 
makes, breaks, transfers, and operating and releasing time intervals, 
ranging from a few milliseconds to longer than one-half second in slow- 


2905 


2906 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


SPRING CLAMP 





Fig. 1 — Wire spring relay. 


release applications. The operating life of these relays approaches ap- 
proximately one billion operations. In view of these considerations and 
the outstanding performance record of the relays now in use, the demand 
has been continually rising over the past years. Since the production of 
general-purpose wire spring relays began in 1950, more than one hundred 
million have been manufactured, and approximately twenty million of 
these were produced in 1963. 

One per cent silicon steel was chosen originally as the magnetic core 
material for these relays because of its high resistivity, good magnetic 
properties, low aging characteristics, and its ability to achieve fast, 
efficient and stable operating characteristics. This material has proven 
satisfactory but over the years has presented some manufacturing and 
procurement problems. Therefore, consideration has been given to the 
use of alternate materials, particularly low-carbon steel for the relay 
cores. This article will discuss the theoretical and the manufacturing 
aspects involved in substituting laminated low-carbon steel for the 1 
per cent silicon steel cores. Only the general run of fast-operate and 
fast-release relays will be considered, since the major effect of the ma- 
terial substitution is due to the difference in the eddy current time con- 
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stants (because of the low resistivity of low-carbon steel). The slow- 
release relays will not be discussed, since they have built-in eddy current 
inducers, such as short-circuited windings or copper sleeves, which re- 
duce the eddy current conductance of the magnet core to secondary 
importance. 


II. PRESENT CORE DESIGN 


When the general-purpose wire spring relay was developed in the late 
1940’s, the best material available for magnetic cores was 1 per cent sili- 
con steel. As a result, the magnetic design of the relay was based on the 
use of this material. One per cent silicon steel has relatively high electrical 
resistivity, which keeps the eddy current time constant of the structure 
to a minimum, thus permitting fast operate and release times. The ma- 
terial has good magnetic properties with relatively high flux densities, 
which permit the development of ample pull forces between the relay 
armature and core. Also, it is a stable magnetic material that does not 
exhibit any significant change in properties with time. As a result, gen- 
eral-purpose wire spring relays with silicon steel cores have delivered re- 
liable performance with good operating margins for the past fourteen 
years. 

The disadvantages of silicon steel have been in the manufacturing and 
procurement areas. This steel is relatively difficult to fabricate by punch- 
ing, because it has an abrasive action on punches and dies which necessi- 
tates frequent tool maintenance. Also, it is not as readily available and 
is more expensive than the low-carbon steels, such as S.A.E. 1010 steel. 
As a result of these considerations, there has been a continuing effort to 
find or adapt a substitute magnetic material for cores of general-purpose 
wire spring relays. This has led to the development described in the sub- 
sequent sections of this article. 


III, LAMINATED CORE PROPOSAL 


Recently a new and relatively inexpensive method of annealing low- 
carbon steel to obtain good and stable magnetic characteristics has been 
developed using “wet forming” gas. This annealing technique produces 
in low-carbon 8.A.E. 1010 steel magnetic properties comparable to those 
obtained with 1 per cent silicon steel. Thus, low-carbon steel, which is 
much less expensive, could be considered as a substitute for silicon steel 
in the cores of general-purpose wire spring relays. However, 8.A.E. 1010 
steel has a nominal electrical resistivity of 12 microhm cm™!, whereas 
the comparable value for 1 per cent silicon steel is 25 microhm em7, 
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The eddy current conductance of a material is inversely proportional to 
the electrical resistivity; hence, the eddy current conductance of 8.A.E. 
1010 steel is about twice as great as that for 1 per cent silicon steel. Since 
the operate and release times of a relay are directly affected by the eddy 
current conductance, a direct substitution of 8.A.E. 1010 steel for 1 per 
cent silicon steel would materially increase these times in general-purpose 
wire spring relays. Such a change in performance would be intolerable, 
since many switching circuits are designed to take full advantage of the 
fast operate and release times obtainable with the present relays. An 
increase in these times would result in circuit-race conditions or add to 
circuit holding times, thereby increasing the number of common control 
units needed in a central office. 

Theoretically, if the volume and shape of a piece of magnetic material 
are not changed but the material is laminated with equal-size lamina- 
tions, the effective eddy current conductance is reduced by an amount 
which is inversely proportional to the number of laminations. Also, if 
the cross-sectional area is rectangular, the eddy current conductance is 
further reduced as the ratio of width to thickness is increased. Thus, 
laminating a rectangular cross-section magnetic part reduces the effec- 
tive eddy current conductance to between 1/N and 1/N? of the un- 
laminated value, where N is the number of equal-size laminations. 

Taking advantage of the recent development in annealing and the 
concept of laminations, it was therefore proposed that the 1 per cent 
silicon steel core of the general-purpose relay be replaced with a core 
made up of two equal laminations of S.A.E. 1010 low-carbon steel. The 
effect of this material change on the operation of the relay will be dis- 
cussed from both the practical and theoretical aspects, with a view to 
showing that the change results in a relay equal in performance to those 
produced during the past several years, at a considerable cost saving. 


IV. PRACTICAL ASPECTS OF THE PROPOSAL 


In order to introduce a substitute design for a functional part of the 
relay, the following factors must be considered: 


4.1 Operational Factors 


(1) time characteristics 
(a) electrical operate time 
(b) electrical release time 
(2) magnetic pull vs ampere turns 
(3) heating (watts input vs temperature rise) 
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(4) life 
(a) wear vs number of operations 
(b) wear effect on operate and release characteristics 
(5) core plate tightness (due to staking efficiency of softer 1010 steel 
material) 
(6) corrosion protection (effectiveness of plating along laminated seam 
if the laminations are welded before plating). 


4.2 Manufacturing Factors 


(1) Shape of laminations 
(a) economic considerations (punching properties and tool life) 
(b) assembly considerations (dimensional considerations for spool- 
head and core plate areas) 
(2) cost of material 
(3) loose vs attached laminations 
(a) handling ease 
(b) assembly ease 
(4) method of punching 
(a) single (one at a time) 
(b) double (two at a time, i.e., one on top of the other) 
(5) welding or mating of Jaminations 
(a) before punching 
(b) after punching 
(c) location of welds. 

To have the laminated §.A.E. 1010 steel core accepted for use in the 
relay, the new design must perform as well as the old design with regard 
to all of the operational factors and should have definite advantages with 
regard to the manufacturing factors. In order to obtain the maximum 
improvement in the manufacturing area without affecting the over-all 
relay, it was necessary to introduce the minimum number of changes to 
the structure. As a result, the object of the laminated core proposal was 
to match, as nearly as possible, all of the characteristics of the present 
general-purpose wire spring relay having a silicon steel core with a new 
core of the same physical outside dimensions. 

Analysis of the magnetic properties of §.A.E. 1010 steel annealed by 
the wet forming gas method indicated that sufficient magnetic pull would 
be developed with this material provided the efficiency of the relay’s 
magnetic circuit was maintained. However, as indicated previously and 
analyzed in detail in the next section, laminating the core was necessary 
in order to reduce the eddy current conductance to tolerable levels. To 
be assured that the benefits of laminating the core would always be 
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present, it was at first believed necessary to physically separate the 
laminations by depressions or an insulating film to prevent the flow of 
eddy currents between the laminations. From a manufacturing stand- 
point, it was decided that if this were necessary, it would be more prac- 
tical to depress a large section of one of the laminations instead of using 
an insulating finish. As a result, the first sample cores were made this 
way. For comparison, a standard core is shown in Fig. 2 and laminated 
cores of the first design in Figs. 3 and 4. 

Tig. 4, a side view of the laminations, shows the recessed section in the 
upper lamination to provide an air gap over the greater portion of the 
length of the core. The two laminations are in intimate contact at the 
two ends to provide a low-reluctance path for the magnetic flux to pass 
from the bottom lamination through the upper to the relay armature. 
However, timing tests of various combinations of recessed laminations, 
as well as flat laminations, in relays have shown that it is not necessary 
to create a positive or visual air gap between the parts. Apparently, the 
surface resistance of the laminations duc to normal oxide films is suffi- 
cient to keep the eddy currents of the laminations from combining. 





Fig. 2 — Standard one-piece core. 


X=WELD POINTS (4) 


SLANT 
VIEW 





{ 
TOP BOTTOM VU 
LAMINATION LAMINATION 


Fig. 3 — Laminated core. 
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Fig. 4 — Side view of laminated core. 


In Fig. 3, a slant view of the laminations, four welds are shown at 
points marked (X). The welding was done before magnetic annealing or 
plating of the parts. The welds are located at the front and rear, where 
they have the least effect on the eddy currents in each lamination. The 
welds are proposed only to associate the two laminations punched to- 
gether and to facilitate the assembly of the relay. Relays were assembled 
and tested and satisfactory results obtained when the two laminations 
were not welded together. The laminations in this case were held to- 
gether by the core plate at the front end and the spring clamp at the rear 
end after the relay had been assembled (see Fig. 1). 


V. THEORETICAL ASPECTS OF THE PROPOSAL 


5.1 Symbols 


The following is a list of the symbols used in the theoretical discussions 
presented in the balance of this article. Where variant forms of these 
symbols, as distinguished by subscripts, are used in the text, they are 
defined in connection with the specific use. 

a, — inner surface area of coil 


a2, -— outer surface area of coil 

E+ —applied voltage 

F -—magnetic pull 

G —total equivalent single-turn conductance 

G. — equivalent single-turn conductance of coil — N?2/R 

G. — equivalent single-turn eddy current conductance 

H, — coercive force (oersteds) 

I —steady-state current 

~  -—Z§Anstantaneous current 

K —total thermal conductance 

k, — thermal conductance per unit area of coil inner surface 
ke, —thermal conductance per unit area of coil outer surface 
I, —single-turn inductance — (47/1) or @/NI) 


— number of turns in coil 


z 
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NI —- steady-state ampere turns 


Ni — just operate or just release ampere turns 
NI,— ampere-turn coercive force — (H,l/0.47) 
R  —coil resistance 
t —time 
t; — waiting time 

— ratio of flux at time ¢ to steady-state flux 


v 
W —power — [?R 

® —magnetomotive force —4aNI or Rd 
1 —lJength of magnetic path 

G —reluctance of relay 


®. —reluctance of core 
GR, — initial incremental reluctance 
@ — flux 
¢1 — steady-state flux 
¢” —saturation flux 
oo — residual flux 
69 —ambient temperature 
6 —mean coil temperature. 
5.2 General 


Since the only change proposed in the relay is the core material, the 
armature, contact springs, balance spring and other operating parts will 
be unchanged. The mass and spring forces of these parts control the 
travel time of the relay in both the operate and release directions. The 
change in core material will, because of a different eddy current con- 
ductance, affect only the electrical waiting time of the relay. Therefore, 
the electrical waiting time on the operate and then on the release of the 
relay will be considered first. 


5.2.1 Electrical Waiting Time — Operate 


The electrical waiting time on operate of a relay is defined as the time 
from the application of potential to the relay coil until the magnetic pull 
on the armature equals the back force on the armature and it starts to 
move. During this period the flux development in the structure follows 
the relationship: 


| dp _ 
5-8. + 4G = 0 (1) 


where %, is the steady-state magnetomotive force (mmf) or 4rNI, § 
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is the effective mmf and is equal to Rd, and G is equal to the coil con- 
ductance (G.) plus the eddy current conductance (G.). The reluctance 
(R)is a function of the armature-core air gap (X) and the flux @). With 
the armature at rest against the back stop and using the initial condi- 
tions X; and @1, (1) can be rewritten in the integral form with ¢:Q1 
substituted for §; as follows: 








4nG f° do 
eee 2 
‘Ri Jo di — 2) 
which on integration gives: 
4nrG 1 
i = AL In ee, F (3) 


Since v, the ratio of flux attained at time ¢ to the steady-state flux, can 
be written as the ratio of the just operate ampere turns to the steady- 
state ampere turns Ni/NI, and 47/@, is equal to Z,, the single-turn 
inductance, (3) may be rewritten as follows: 


i= Iy(G, = G.) In (4) 


1 
1 — (Ni/NT) 
which is the general form of the equation for the electrical waiting time 
of a relay on operate. 


5.2.2 Electrical Waiting Time — Release 


The release waiting time of a relay is defined as the time from the 
opening of the coil circuit to the beginning of motion of the armature 
from the operated position. The opening of the circuit results in a de- 
caying magnetic field which is sustained only by eddy currents. The 
release waiting time is described by the same relationship as the operate 
waiting time except that since §, = O with the coil circuit open (1) be- 
comes: 


do 

® + 4cG ao (5) 
The normal-release waiting time of a relay without copper sleeves can 
only be determined approximately because of the variable distribution 
of the magnetic field sustained only by eddy currents (G.). However, if 
the flux decay is retarded by the introduction of a conductance much 
larger than G., such as a copper sleeve or short-circuited coil turns, 
the decaying field is nearly uniform and a relatively close approximation 
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to (5) can be made. Equations thus derived for the slow-release case can 
be used for the approximate analysis of the normal-release time case. 
Then (5) can be written in the integral form: 


$1 do 
= AG if ~ (6) 


where é; is the waiting time for the flux to decay from the steady-state 
value (¢1) to the value (¢) at which the magnetic pull is equal to the 
retractile force. 

Yor reliable and repetitive release times, the steady-state flux (41) 
of a relay should be in the region of flux saturation (¢”). Therefore, re- 
lease times will only be considered from this condition. Since § = Rd 
and the relationship between ¢ and § is in the demagnetization curve, 
the following equation results: 


($” — do) ($ — $0) 
Cee ” 


where ®, is the incremental reluctance when § = 
If (7) is substituted in (6), the expression for release waiting time 


becomes: 
_ 4G 1 
dd. 8 
ae (- wis) ? 8) 


Integration of this equation results in: 


= ae 





1, (9) 
where 
z= o” — do = do 
¢— bo” 
To have a more readily usable relationship for release waiting time, 


it is necessary to obtain an expression for Z in terms of § or 47N2. 
Equation (7) can be rewritten to give the following expression for: 


¢” ae do 
= 1 R,. 10 
ane AgNi 10) 
Substituting the expression for ®, as obtained from (10) in (9), and 
recognizing that with the coil circuit open and no sleeves or short-cir- 
cuited turns the only conductance involved is the eddy current con- 
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ductance (G.), the following expression for release waiting time is ob- 
tained: 





_ Gee” — bo) { NZ _ 1 
= BE (ne —2)- oe 


VI. EVALUATION OF OPERATE AND RELEASE WAITING TIME 


Equations (4) and (11), for operate and release waiting times respec- 
tively, can be evaluated by obtaining values for the variables experi- 
mentally and graphically. In this section the procedures for the establish- 
ment of the relay parameters will be discussed. 

The first data needed are magnetization curves of flux vs ampere turns 
with the armature in the unoperated position for the evaluation of op- 
erate waiting time and in the operated position for release waiting time. 
Typical curves are shown in Figs. 11 through 16 (see Section VIII). 
With measured values of just operate, just release and steady-state cur- 
rent, all of the flux values for (4) and (11) can be read from the curves. 
The inductance per turn (Z;) may be found by drawing a line through 
the origin of the unoperated magnetization curve tangent to the nearly 
flat or linear portion of the curve (see Fig. 5). The slope of the tangent is 
a reliable value for ZL, if the just operate flux falls on the linear portion of 
this curve. 

Since G in (2) and (6) is equal to the sum of G,. and G., the effective 
eddy current conductance G, can be determined reasonably well graphi- 
cally and experimentally by holding the values of the integrals of the two 
equations fixed and making timing measurements as G, , which is equal 


FLUX (¢) —> 





AMPERE TURNS,NI —> 


Fig. 5 — Inductance per turn (Z,) from magnetization curves. 


2916 HE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


R,OHMS (VARIABLE) 






I= CONSTANT RELAY 


ae 


F (CONSTANT) 


E,VOLTS (VARIABLE) 


Fig. 6 — Circuit for varying G, or N?/R by changing R (coil in operate condi- 
tion). 


to N?/R, is varied. A fixed value for either integral can be assured by 
having the relay adjusted so that its steady-state current and either just 
operate or just release current are maintained constant throughout the 
experiment. G. or N?/R can then be varied by changing the resistance 
in series with the coil as shown in Fig. 6 for the operate condition and 
Vig. 7 for the release condition. Since all factors except G, are held con- 
stant, a plot of G. versus waiting time will be linear, and when extrap- 
olated to t = 0 will have a negative intercept on the G, axis equal to 
G. as shown in Fig. 8. 

Values for all of the variables in (4) and (11) were determined for both 
one-piece 1 per cent silicon steel cores and laminated 8.A.E. 1010 steel 
cores and the waiting time for operate and release computed. The com- 
puted values are compared to measured values in a later section. 


VII. EXPERIMENTAL DETERMINATION OF OPERATE AND RELEASE TIMES 


In production, permissible dimensional tolerances of the parts and 
differences in the magnetic characteristics of the cores due to material 
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Fig. 7 — Circuit for varying G. or N?/R by changing F (coil in release condi- 
tion). 
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Fig. 8 — Waiting time vs coil conductance. 


and annealing variations will result in variations in the operational char- 
acteristics of the relays. To evaluate these effects, two sample groups of 
relays with each core material were constructed. Group SP was made 
with minimum-dimension parts having a poor anneal and group LG had 
maximum-dimension parts with a good anneal. Fig. 9 shows the operate 
time of these relays as a function of coil conductance with an unoperated 
air gap of 0.032 inch and with input powers of 2 and 10 watts. It will be 
noted that the laminated S.A.E. 1010 steel core relays are approximately 
1.3 to 3.4 per cent faster than the 1 per cent silicon steel core relays. 

Fig. 10 shows the release times and release pull values for the same 
groups of relays. Here, it is noted that the laminated 8.A.E. 1010 steel 
core relays release from 13 to 27 per cent faster than the 1 per cent silicon 
steel core relays. 


VIII. COMPARISON OF CALCULATED AND MEASURED TIMES 


The measured times shown in Figs. 9 and 10 all include some armature 
movement. However, all of the relays had essentially the same mechani- 
cal adjustments, and the same SP and LG armatures were used on both 
the laminated S.A.E. 1010 steel and 1 per cent silicon steel cores. There- 
fore, it can be assumed that the mechanical armature travel times of the 
relays were essentially the same. As a result, a comparison of like sets of 
relays, 1.e., SP laminated versus SP one-piece, etc., should reflect the 
difference in electrical waiting time of the groups being compared. 

A comparison of the calculated times for the like sets of relays was 
also made using the values of L, G., ¢, 0 , ete., obtained from Figs. 11 
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Fig. 9 — Change in relay operate time due to variation in relay dimensions 
and quality of anneal. Relays had cores of either laminated 1010 steel or 1% silicon 
steel. 


through 16 as described earlier. Tables I and II show the comparison 
between the laminated S.A.E. 1010 core relays and the 1 per cent silicon 
steel core relays as determined by measurement and by calculation. In 
all cases good agreement was found between the measured and calculated 
values. 


IX. COMPARISON OF EDDY CURRENT CONDUCTANCE 


In Tables I and II are listed the values of eddy current conductance 
obtained for the SP and LG groups of relays. 7.20 and 8.25 kilomhos 
respectively were found for the 1 per cent silicon steel cores and 5.55 and 
6.65 kilomhos respectively were found for the laminated 8.A.E. 1010 
steel cores. Since the §.A.E. 1010 steel has approximately twice the con- 
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Fig. 10 — Release pull values (grams) and release times (ms) for relays with 
cores differing in dimension, quality of anneal, and type of steel used. 
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Fig. 11 — Magnetization curves of cores with small dimensions and poor 
anneal (0.006 air gap). 


ductivity of 1 per cent silicon steel, the eddy current conductance of the 
S.A.E. 1010 cores would have been about 14.4 and 16.5 kilomhos for 
the SP and LG groups respectively if the cores had not been laminated. 
Thus laminating reduced the eddy current conductance of the cores by 
the ratios of 5.55/14.4 or 38.5 per cent for the SP group of relays and 
6.65/16.5 or 40.3 per cent for the LG group of relays. Since it was ex- 
pected that the use of two laminations would reduce the effective eddy 
current conductance by a ratio of between 1/N and 1/N? or between 50 


and 35.4 per cent, good agreement with the theoretical analysis is indi- 
cated. 


X. COIL HEATING (POWER INPUT VS TEMPERATURE RISE) 


Another major consideration is the effect of using a laminated core 
and a new material on the dissipation of heat from the relay coil. The 
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Fig. 12 — Magnetization curves of cores with large dimensions and good 
anneal (0.006 air gap). 


allowable mean temperature rise of a relay coil is limited by two factors. 
The first of these is that in normal operation the temperatures should 
not rise to a point that is dangerous to personnel in case of physical 
contact. This limit has been established for many years in the Bell Sys- 
tem at 225°F. The temperature rise in normal operation is a function of 
the duty cycle of the relay and is influenced, therefore, by its circuit 
application. The second limitation on temperature rise — that the relay 
shall not become a fire hazard in case of indefinite energization — is of 
more direct interest from an apparatus standpoint. With the normal 
wire insulations and coil insulating materials used in Bell System relays, 
it has been found that a maximum mean coil temperature of 360°F can 
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Fig. 13 — Magnetization curves of cores with small dimensions and poor 
anneal (0.032 air gap). 


be allowed with essentially no risk of insulation breakdown which would 
produce a fire hazard. It is recognized, though, that prolonged exposure 
of a relay to such a temperature could result in permanent damage. 

The dissipation of heat from a relay coil occurs mainly from the inner 
and outer surfaces by a combination of conduction, convection and 
radiation, with negligible dissipation at the coil ends. Convection and 
radiation are principal factors at the outer surface and conduction 
through the insulation and core is the principal factor at the inner sur- 
face. 

The dissipation of heat is therefore through parallel paths which 
can be represented by the electrical circuit analogy shown in Fig. 17. 
The imposed voltage is equivalent to the temperature difference between 
the coil and ambient, the electrical current is equivalent to the heat 
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Fig. 14 — Magnetization curves of cores with large dimensions and good 
anneal (0.032 air gap). 


flow in the branches, and the electrical conductance is equivalent to the 
heat conductance. As shown in Fig. 17, there is a circuit of two branches: 
one from the coil through the outer core surface to ambient and the 
other from the coil through the inner coil surface and the core to ambient. 

From this analogy it has been found that a good approximation of 
the mean coil temperature can be obtained from the relationships: 


1 
K => kode + 
el a2) 


kia, 


and 


i = ae 6 — 4% ) 
Roe (1 * 390 + 6 ve 
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Fig. 15 — Ampere turns vs operate pull values for cores with small dimensions 
and poor anneal. 
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Fig. 16 — Ampere turns vs operate pull values for cores with large dimensions 
and good anneal. 
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Per Cent Decrease in Operate Times — ¢z (laminated 
- core) vs ¢s (1% silicon core) 
s Core 
B L Ge 2 watts 2 watts 10 Watts 10 watts 
S| uH | Kmho Ge = 25 Ge = 40 Ge = 25 Go = 40 
a Kmho Kmho Kmho Kmho 
> 
s Material Type Calc. | Meas.| Calc. | Meas. | Calc. | Meas.| Calc. | Meas. 
: = 
SP | SAE 1010 | lam. |0.267) 5.55 
SP | 1% silicon] solid |0:264] 7.20{|9-776[8- 370) -576/2.876)3.470)3.470)2 .370)2.17o 
LG | SAE 1010 | lam. |0.298) 6.65 
LG | 1% silicon| solid |0.291| 8.25/|2-3/0|2-876[1 -170/1 .3870)2.376)2.77)3.570)8 37% 
Taste II — Errect or Ustna LAMINATED CORES ON Eppy 
CURRENT CONDUCTANCE AND RELEASE TIMES 
Per Cent Decrease in Release Times —/ 1 
(laminated core) vs ¢s (1% silicon core) 
Core 
Relay Ge 
Dimension Kmho 20 Ni Release 60 Ni Release 
Material Type Calc. Meas. Cal, Meas. 
SP SAE 1010 lam. 5.55 
SP | 1%silicon | solid 7.20 247% | 25% | 27% | 27% 
LG AE 1010 am. 6.65 
LG |1% silicon | solid 8.25 16% | 18% | 19% | 14% 





where R, = the thermal resistance of the core in ‘‘Fahrenheit ohms,” 
Ro = the coil resistance at ambient temperature, 
6) and @ are expressed in degrees Fahrenheit 
and other symbols are as defined earlier. 
The exact thermal conductance of complete relay structures will vary 


Rp =1/Kp ap (COVER) 


(SOURCE OF HEAT) 





R,=1/K,a,(CORE SPACE) 


Fig. 17 — Electrical circuit analogy of dissipation of heat. 
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considerably from unit to unit because of variations in relay assembly, 
such as the tightness of fit of the coil on the core. However, nominal 
values for a1, d2, ki, k2, and FR, have been established for general-pur- 
pose wire spring relays with 1 per cent silicon steel cores. These values 
along with the theoretical difference in R, for laminated S.A.E. 1010 
steel cores can be used to calculate mean coil temperatures. The estab- 
lished mean values are as follows: 


ky = 0.01 watt/F°/in.? 


ke = 0.0055 watt/F°/in2 
ay = 2.32 in? 
a2 = 5.54 in2 


R., = 35 Fahrenheit ohms (solid 1 per cent 
silicon steel core). 


Then with an applied voltage of 100 volts de, a coil resistance of 1451 
ohms, and an ambient temperature of 77°I’, the mean coil temperature 
of a relay with a 1 per cent silicon steel core was calculated as follows: 


kia, = 0.0232 watt/F° 
keae = 0.0304 watt/F° 


1 
= 0.0304 + ——~j_ = 0.0432 watts/T° 


i} 
go's 0.0232 


kya 





Tes caak 6 — Bp ) 
(100)? 


ae 6 — 77 
“TABL 0.0432(@ — 77) (1 = saa) 


390 + 77 
6, = 204°F (1 per cent silicon steel core). 


The mean thermal conductivity of S.A.E. 1010 is 53 watts/C°/em? while 
the mean thermal conductivity of 1 per cent silicon steel is 28 watts/ 
C°/em?, so that they are in a ratio of 53/28 or 1.89. 


30 


a = Teg > 18.5 Fahrenheit ohms 


. R 
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for 8.A.E. 1010 steel cores. With the value of R.; for S.A.E. 1010 steel 
cores and the other constants with the same as above, except that with 
this core the coil resistance was 1443 ohms, the mean coil temperature 
was calculated as follows: 


1 
K, = 0.0304 -+- —————— _ = 0.0467 watt/F° 
18.5 + F935 
and 
(100)? _ Lhe ( 6— 77 ) 
oe do is (ag C0 0 rer 7; 


6, = 195°F CUaminated S.A.E. 1010 steel core). 


For confirmation, heat tests were conducted on sample relays with 
both types of cores using applied voltages of 50 and 100 volts de. Fig. 
18 shows the mean coil temperature as a function of time. With an 
applied potential of 100 volts de the calculated mean coil temperatures 
are 204°F and 195°F for the 1 per cent silicon steel and 8.A.E. 1010 
steel cores respectively, while the measured values are 205.5°F and 
199.5°F in the same order. Thus the measured values are found to be 
in close agreement with the calculated values. Although there is nearly 
a two-to-one ratio between the thermal conductances of the two mate- 
rials, there is only a 3 to 4 per cent difference in the mean coil tempera- 
tures. However, this small difference is in favor of the relays with lam- 
inated §.A.E. 1010 steel cores. 


240 
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Fig. 18 — Mean coil temperature vs time. 
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XI. OPERATING LIFE 


Since 8.A.E. 1010 steel is softer than 1 per cent silicon steel, life tests 
were conducted to determine whether relays with laminated low-carbon 
steel cores have at least as long an operating life as currently manufac- 
tured relays. In operation, the armature stop disks and the hinge spring 
at the heel of the armature wear or pound into the core (see Fig. 19). 
If significant wear occurs at either of these points, it will cause a cor- 
responding reduction in the release current and an increase in release 
time. 

Fig. 20 shows the measured wear at the armature heel and the change 
in release time as a function of the number of operations. It is noted 
that the wear of the laminated §8.A.E. 1010 steel cores is about equal 
to or less than that of the 1 per cent silicon steel cores. This is probably 
due to the lack of abrasiveness of the low-carbon steel as compared to 
the silicon steel after the finish has been worn through. 


XII. CORE PLATE ASSEMBLY 


The core plate of the relay (see Figs. 1 and 19) serves as a back stop 
for the armature, a positioning stop for the fixed contact molded block, 
an aligning fixture for the three core legs and a means of mass adjust- 
ment of the contacts. Therefore, it is essential that the core plate be 
securely fastened in place. The core plate is held in place by staking the 
ends of the two outer legs of the core as shown in Tig. 19. Tests show 
that the pull-off force of the core plate on the laminated low-carbon steel 
cores is approximately twice that of the pull-off force from silicon steel 

FRONT 
SPOOLHEAD 
\ ARMATURE 
aie hd 






HINGE SPRING_~ 
WEAR AREA 


STAK oe \ 
IMPRESSIONS | STOP DISK 


WEAR AREA 


ADJUSTING 
SLOT (2) 


Fig. 19 — Wear areas of stop disks and hinge spring. 
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Fig. 20 — Heel wear and release time vs number of operations. 


cores. The softer S.A.E. 1010 steel is upset more by the staking opera- 
tion than the harder 1 per cent silicon steel. 


XIII. CORROSION PROTECTION 


If the two laminations were to be welded together immediately after 
punching, it was deemed necessary to establish the reliability of corro- 
sion protection, along the laminated seam, obtained with the standard 
zinc-chromate finish on such an assembly. Therefore, a number of lam- 
inated core assemblies of $.A.E. 1010 steel were punched and welded 
together, magnetically annealed and then plated in the normal manner. 
The laminated assemblies, along with standard zinc-chromate plated 
1 per cent silicon steel cores, were subjected to extensive corrosion studies 
under extremes of temperature and humidity. The extent of corrosion 
on both materials was within tolerable limits. 


XIV. MANUFACTURING FACTORS 


The manufacture of a relay with a two-piece laminated core presents 
a number of problems of dimensional control and parts handling which 
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must be overcome before its introduction is economically feasible. Close 
dimensional control of core thickness is necessary in the areas where 
the front coil spoolhead and core plate are mounted, to assure tight and 
stable assemblies (see Figs. 4 and 19). With the one-piece core the thick- 
ness dimensions in these areas are controlled accurately by a squeezing 
operation on the part as the last step of the fabrication. With two sep- 
arate laminations these areas would require closer control of the material 
thickness and the depressed areas, since with two parts fabricated sep- 
arately the thickness tolerances would be additive on assembly. 

A proposal to overcome this problem was suggested (see Fig. 21) 
in which one lamination is undercut so that only the thickness of one 
lamination appears in the critical areas. This would require a redesign of 
the core plate and spoolhead to fit the new core. Tests of sample relays 
with cores of this design revealed an appreciable degradation in pull 
and time characteristics. 

An alternate proposal was suggested whereby two strips of $.A.E. 
1010 steel are welded together at prearranged locations and then punched 
simultaneously as a single part. The welds between the two sheets are 
located so that after punching they appear on the core at the locations 
shown in Fig. 3. With this method of welding and punching, the two 
laminations of a core remain together throughout the fabrication proc- 





, Fig. 21 — SAE 1010 steel laminated cores for general-purpose relays (rejected 
esign). 





RELAY CORES 2931 


ess, and the assembly is flattened and squeezed to size in the same oper- 
ation as is now used with the one-piece core. Fig. 22 shows the two 
laminations without welding in the upper view, and a welded laminated 
core compared to a one-piece core in the lower view. Tig. 23 shows a 
schematic layout of the proposed process of welding the strip and punch- 
ing the cores. 

There is no appreciable difference in the operational quality of these 





Fig. 22 — (a) Two core laminations before welding; (b) a welded laminated 
core compared to a one-piece core. 
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Fig. 23 — Schematic layout of the proposed process of welding the strip and punching the core. 
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relays whether the core laminations are welded or not. Therefore, the 
ultimate design in this respect will be determined by manufacturing 
considerations. 


XV. SUMMARY AND CONCLUSIONS 


(1) Wire spring relays made with laminated 1010 steel cores are at 
least as good as and in some respects better than the present standard 
1 per cent silicon steel core relays. 

(2) Operate times of the proposed laminated 1010 steel core relays 
are in general slightly faster than those of the present one-piece silicon 
steel core relays (1.3-3.4 per cent). 

(3) Release times of the proposed laminated low carbon steel core 
relays are considerably faster than those of the present silicon core 
relays (13-27 per cent). 

(4) Heat studies indicate slightly better heat dissipating qualities in 
the laminated core relays than in the present silicon steel core relays. 

(5) Core plates are tighter on the laminated 1010 steel core relays 
than on the present silicon steel core relays. 

(6) Resistance to corrosion with the present finish (zine chromate) 
is as good on the laminated 1010 steel core design as on the one-piece 
silicon stcel core design. 

(7) Life tests show that the laminated low carbon steel cores are at 
least as resistant to wear as the present silicon steel cores. 

(8) Substantial manufacturing savings can be realized by changing 
to the laminated 1010 steel core for the general-purpose wire spring 
relay. 
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A Model For Mobile Radio Fading Due 
to Building Reflections: Theoretical 
and Experimental Fading 
Waveform Power Spectra 


By JOSEPH F. OSSANNA, Jr. 
(Manuscript received May 20, 1964) 


Fluctuations in received signal amplitude occur during mobile com- 
munications because of the motion of the mobile station through the spatial 
standing-wave pattern resulting from the interaction of direct and reflected 
signals. A model ts presented which permits a theoretical calculation of the 
power spectrum of these fluctuations and satisfactorily predicts the features 
of spectra computed from experimental fading data except for an observed 
rise at low frequencies. 

The model is based on the geometry of the reflections from nearby ran- 
domly placed vertical plane reflectors. Vertical polarization is assumed. 
Both the standing-wave pattern and the Doppler shift view of fading are 
used to obtain nearly identical results. The detailed shape and in particular 
the sharp cutoff frequency of the spectrum are shown to depend crucially on 
the angle a between the direction of vehicle motion and the direction to the 
jixed station. Detailed comparisons are made of theoretical spectra with 
experimental spectra representing a range of the angle a. 

The collection, digitization, calibration, plotting, and digital processing 
to obtain power spectra of actual recorded fading waveforms are described. 


I. INTRODUCTION AND SUMMARY 


The mobile radio fading phenomenon discussed herein is the fluctua- 
tion in the received signal amplitude during mobile communication due 
to the motion of the mobile station through the spatial standing-wave 
pattern resulting from the interaction of direct and reflected signals. 
Knowledge of the statistical behavior of such fading signals can permit 
more meaningful mobile communication system studies and design 
effort. For example, it can aid in the choice and design of automatic 
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gain control systems and systems involving data transmission. The par- 
ticular statistical description of fading to be discussed is the power 
spectrum of the signal amplitude. 

First, this article presents a theoretical model for mobile radio fading 
due to building reflections, which permits a theoretical calculation of the 
power spectrum of the fading waveform. Second, the collection, digital 
processing, and power spectral analysis of actual fading waveforms are 
described. Then detailed comparisons are made between the theoretical 
and experimental spectra. Limitations and extensions of the model are 
explored. 

Historically, the evolution of the model followed a study of the power 
spectra of fading waveforms recorded on one particular street. These 
spectra had an unexpected and interesting shape. The model indicated 
that the shape of the spectrum and particularly the frequency at which 
the power density fell abruptly would depend on the vehicle’s direction 
of travel with respect to the direction toward the fixed station. Subse- 
quent fading waveforms, recorded on other streets having various rela- 
tive directions, produced power spectra which collectively exhibited all 
the features predicted by the model. 


1.1 Characteristics of Experimental Power Spectra 


Fading waveforms were recorded on 13 streets in New Providence, 
N. J. on Sept. 13, 1962, using a carrier frequency of 838 me and a nomi- 
nal vehicle speed of 15 mph. Vertical polarization was used. The fade 
rate corresponding to motion through standing-wave minima a, half- 
wavelength apart is 37.5 cps. After digitization, calibration, and smooth- 
ing, power spectra were computed. Almost all of these spectra exhibited: 
(1) a very sharp cutoff at a frequency somewhere between 20 and 40 
cps, followed, after a drop of between 10 and 20 db, by a 12 db/octave 
fall-off; (2) a narrow peak between 3 and 10 db high just prior to the 
sharp fall; (3) a broader, shorter intermediate peak; and (4) a rise at the 
low-frequency end of 10-15 db. Many spectra exhibited a shelf in the 
frequency range between the sharp fall and the gradual fall-off which 
extended to about twice the sharp-fall frequency. Other sometimes 
subtle features were noted which will be mentioned later. 


1.2 The Model 


The simple model to be described predicts a theoretical power spec- 
trum for the fading waveform having all the features of a corresponding 
experimental spectrum except a low-frequency rise. In particular, the 
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frequency, shape, and size of two peaks, the sharp fall, the following 
shelf, and the subsequent gradual fall-off are satisfactorily predicted. 

The model is based on the geometry of the reflections from nearby 
randomly placed, vertical, plane, good reflectors. Vertical polarization 
is assumed (the use of horizontal polarization is discussed in Section 
XIV). The vehicle is assumed to move through the standing-wave 
pattern caused by the reflections. A virtually identical result obtains if 
the vehicle is assumed to encounter appropriate Doppler shifts for each 
reflected signal. The computed spectrum depends on: (1) the radio 
carrier frequency, (2) the vehicle speed, and (3) the angle a between 
the vehicle direction of motion and the direction to the fixed station. 

If fm is the fade frequency which would be experienced by the vehicle 
moving directly across standing-wave minima spaced \,/2 apart (A. = 
carrier wavelength), the model indicates that the spectrum will peak 
and then fall sharply at some frequency f, between fm/2 and fm and that 
an intermediate peak will occur at a frequency f» — fp . In terms of the 
angle a defined in the previous paragraph, f, is equal to the larger of 
fm sin” a/2 or fm cos’ a/2. As a varies between 0 (or 180°) and 90°, f, 
varies between f and fn/2. 


1.3 Comparison between Experimental and Theoretical Spectra 


Because the angle a varies during a typical data run along any street, 
theoretical spectra were determined by averaging spectra computed 
for sample a’s along the run. The agreement between experimental and 
theoretical spectra is generally quite good. The sharp-fall frequency 
agrees perfectly in almost all cases. Other details are in good agree- 
ment in many cases. The main discrepancy is the absence of a theoretical 
forecast of the rise in the observed spectral density at low frequencies. 
One street which, unlike all the others, had few buildings produced 
experimental spectra only vaguely similar to theoretical expectations. 
Mechanisms not considered in the theoretical model which would con- 
tribute low-frequency energy include: (1) shadowing by buildings, 
(2) variations in and the shadowing of ground reflections, (3) the inter- 
modulation of concurrent reflections and (4) nonrandom reflector 
orientation. 


1.4 Usefulness of Model 


The model in the form offered successfully predicts fading waveform 
power spectra in a suburban residential environment. The possibility 
certainly exists that an extension of the model can be made to work 
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elsewhere. The usefulness of power spectra is not unlimited, and many 
statistical properties of fading cannot easily be deduced from spectra. 
One of the main values of the agreement between these theoretical 
and experimental spectra is its verification of the physical basis of the 
model. 


II. THE MODEL 


We will begin by considering the fading experienced by a mobile 
receiver moving through a standing-wave pattern due to a single reflec- 
tor when the transmission is vertically polarized. Reciprocity will insure 
application of the results to the case where the receiver is fixed and the 
transmitter is moving. First, the fade rate will be related to the vehicle 
trajectory. Then the same result will be obtained using the Doppler 
point of view. Then the relative contribution from reflectors in differ- 
ent directions will be determined. Next, a theoretical spectrum will be 
constructed for the case of many reflectors; its shape will depend strongly 
on the vehicle direction relative to the fixed station. 


III. STANDING-WAVE PATTERN DUE TO A SINGLE REFLECTOR 


If the mobile antenna is a fixed height above the ground, only the 
larger (many wavelengths) vertical plane reflectors in the vicinity of the 
mobile station are of major importance in determining the local standing- 
wave pattern fluctuations. Reflectors of ordinary size which are not in 
the vicinity of either the mobile or fixed stations are of lesser importance 
because their reflected signals will be of smaller amplitude. Reflectors 
near the fixed station can contribute large amplitude reflections, but 
their effect is more that of modifying the directivity pattern of the 
fixed antenna; their effect is to put slow multiplicative trends into the 
standing-wave pattern at the mobile station. Furthermore, fixed station 
antennas are usually mounted above local obstacles that would not only 
reflect but shield some direction. 

We will assume that the transmitting and receiving antennas are 
vertically polarized. Then the local standing-wave pattern due to a 
single vertical plane conducting reflector is as shown in Fig. 1 where: 


g = angle between the direction to the fixed station and the direc- 
tion to the reflector as seen from the mobile antenna, 

a = angle between the direction to the fixed station and the direc- 
tion of vehicle travel, 

6 = angle of incidence at the reflector, 

d = perpendicular distance between null planes, 
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Fig. 1 — Vehicle moving through standing-wave pattern due to single reflector. 


d’ = spacing between null planes observed along direction of mobile 
station travel, 

L length of reflector, 

L’ = length of vehicle path in reflected beam, and 

A. = carrier frequency wavelength. 


The fixed station is assumed to be far enough away to permit taking 
the incident waves on the mobile antenna and on the reflector as parallel. 
The reflector is assumed to be large compared to a wavelength (L > 2.) 
and close enough to the mobile station to neglect divergence of the 
reflected beam. 


3.1 Null Plane Spacing 


Note that a reflector in a direction ¢ must be oriented such that 
8+ (¢/2) = 90° (1) 


for the reflected beam to be directed toward the mobile station (for the 
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angle of reflection to equal the angle of incidence). To determine the 
spacing d between null planes, refer to Fig. 2 and observe that 








a—b=x, (2) 
b/a = cos, (3) 
and 
d/a = sin (g/2). (4) 
Irom (2) and (3), a is found to be 
Ne Ne 
ae a ; (5) 
? 2sin? & 
2 
Then (4) and (5) yield 
d= asnG = — = (6) 
sin = 
2 
or, using (1) 
Xe 
~ 2cos 6" 7) 


This is of course a common result (see Ref. 1, p. 293 ff.). 
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Fig. 2 — Portion of Fig. 1 in vicinity of reflector. 
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3.2 Fading Rate 


Because of its direction of travel, the mobile antenna observes a 
null spacing a’, which from Fig. 3 is 


arr -£)’ (8) 


which by using (6) becomes 
d./2 


d’ ee 
oe _ e\? (9) 
ae Wa (« .) 


which holds for the case where a > g/2. Consideration of various values 
for a and ¢g/2 leads to the general relation 


de /2 


sin 5 sin (« _ e) 


If the vehicle speed is V, the fading frequency is 
, eV gee Ma) 
sin 5 sin ( e) | ; (11) 


Then f’ has the maximum value fn’ = 2V/d., where g = 180° and 


is (10) 
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Fig. 3 — Portion of Fig. 1 in vicinity of receiver. 
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a = 0° or 180°, when the vehicle moves perpendicularly across null 
planes spaced \,/2 apart. For f, = 838 me and V = 15 mph, fm’ = 37.5 
cps. For convenience, we will usually use a normalized fading frequency 
f =f'/fa'. When a # 0° or 180°, f is zero at ¢ = 0, 2a, and 360°, and 
has the maximum values f; = sin? (g/2) at ¢ = a and fr = cos’ (¢/2) 
at ¢ = 180 + a; note that f; + fe = 1. Thus, for a particular a, the 
maximum possible fade rate is finax = max (f,, fe)” and is due to a ve- 
hicle motion either toward or away from a reflector. The minimum 
possible value for fmax occurs when f; = f. = 4, which corresponds to 
a = 90°. The variation of f with ¢ is shown in Tig. 4 for a = 0°, 30°, 
0°, and 90°. 


3.3 Fading Waveform 


If the reflector is perfectly conducting as assumed above, the actual 
waveform observed at the output of an envelope detector in the mobile 
vehicle would be the familiar result of beating two frequencies of equal 
amplitude — a full-wave rectified sine wave. Thus, in addition to the 
fundamental fade rate discussed above, significant harmonics will also 
be present. If the reflector is not perfectly conducting or is only a 
dielectric, minima will occur instead of nulls; the spacing between them 
will remain the same as for the nulls, and the waveform will tend to be 
more nearly sinusoidal. 


IV. THE DOPPLER POINT OF VIEW 


We can instead consider fading as due to the beating within the 
receiver of different carrier frequencies arising from the different Dop- 
pler shifts occurring for the directly incident and reflected waves. The 
carrier frequency observed at the vehicle will in general be 


fo = fe + (v/Xe) (12) 


where f, and i, are the transmitted carrier frequency and wavelength 
and v is the relative velocity of closure between the two stations. Ifrom 
Fig. 1, the observed frequency of the directly incident signal is 





V cosa 
te = fe a ’ (13) 
Xe 
where V is the vehicle speed. 
* Max (a,b,c, ---) = the algebraically largest of the sequence a,b,c, --- . Simi- 


larly, min (a,b,c, ---) = the algebraically smallest of a,b,c, --- . 
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Fig. 4 — Variation of fade frequency f and weighting function W with relative 
vehicle direction a and relative reflector direction ¢. 


The observed frequency of the reflected signal is 


V cos (a — ¢) 


i; = 14 
fe + SE (14) 
The fading rate will then be the beat frequency between f; and f, : 
VE . 
f=fh—-fi= x [eos (a — gy) — cos al] (15) 
Veen 5 
a [sin asin g — cos a(1 — cos ¢)] 
_ V - @ ; eg _ . @g 
= (2 sin 5) sin a COS 5 — cos @ sin | 


22 et ( -£)| 
= 2 T sin & sin 2 5) I: (16) 


After absolute value signs are added to (16) to account for the various 
relative values of a and ¢, the result is identical to (11). 

The Doppler point of view has one important advantage over the 
standing-wave.pattern point of view. It is easier to see what fade rates 
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will occur when more than one reflector is involved; when n reflectors 
are simultaneously effective, n(n + 1)/2 beat frequencies are possible. 


V. THEORETICAL POWER SPECTRUM 


Since the observed fading rate is a function of the parameters a and ¢ 
discussed above, the power spectrum of the fading waveform at the 
output of an envelope detector in the mobile vehicle will evidently be a 
function of time, even if the vehicle speed is constant. In this section, 
we will develop an approximation to the power spectrum of a finite 
duration of fading waveform. We will assume: (1) the vehicle speed is 
constant; (2) the transmission is vertically polarized; (3) an unmodulated 
carrier is being transmitted; (4) the reflectors are large, stationary, 
vertical, plane conductors: and (5) only reflectors in the vicinity of the 
mobile vehicle are important. Other assumptions inherent in the de- 
velopment will be stated when appropriate. 


5.1 More Than One Reflector 


A major step in simplifying the analysis is to assume that, although 
many reflected beams will be encountered by the vehicle during the 
finite run, only one such beam is important at any one time. This elimi- 
nates the necessity of considering beats between reflections. The effect 
of this assumption on the theoretical spectrum will be discussed later. 
Actually, there is a strong tendency for this assumption to be true in a 
suburban residential environment because the houses are well spread out. 


5.2 The Relative Importance of Different Reflectors 


The energy in a particular small frequency band in the finite sample 
of fading waveform will be proportional to the time that frequencies 
in that band are present. The corresponding power spectral density 
will be proportional to the corresponding fraction of the total run time. 
Thus the contribution of a particular reflector to the appropriate fre- 
quency band will be proportional to the time it takes the vehicle to 
cross the reflected beam or, if the vehicle speed is constant, proportional 
to the length of its path through the beam, which is the length L’ in 
Fig. 1. 

If we assume that all the reflectors are the same size (Z in Fig. 1), 
then the contribution of a particular reflector to the power spectrum 
will be proportional to a weighting function W = L’/L. The assump- 
tion of equal-size reflectors is another assumption that has a strong 
tendency to be true in suburban residential areas, where all the houses 
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in a given locale tend to be the same size. From Fig. 3 it is evident that 


W = L’ __ sin (y/2) 


Lana ey - 


Consideration of the various values of a and ¢ yields the general result 
vr 
ao 
Because L’ cannot exceed the total run length Zz , the physical maximum 
value of W is Wimax = L/L. Fig. 4 shows the variation of W with ¢ 
for a = 0°, 80°, 60°, and 90° ; the plots of W, which are in db (10 logiy W) 
are shown directly below corresponding plots of the fade rate f. W has 
the value zero (— © db) at ¢ = 0° (and 360°), except when a = 0 
where W = 0.5 (—3 db) aty = 0°. W = 1 (0 db) ato = 2a. And W 
is truncated to Wimax at ¢ = a and 180° + a; the peaks of W are coin- 
cident with the peaks of the fade rate f. Not only do the reflectors 
directly ahead or behind the vehicle cause the most rapid fades, but 
they are the most important contributors to the power spectrum. 

It is interesting to note that the weighting function W can be arrived 
at in another way. Consider the portion of the f vs ¢ curve between 
gy = 0° and ¢ = a. The small range of reflector directions dg contribut- 
ing to a small frequency band df can be found by differentiating (11) 
to get 


sin 9/2 


wes sin (a — ¢) |" 


(18) 





df/dp = 3 sin (a — 9). (19) 


The projected length of a reflector in a direction g is L, = L cos 6 = 
L sin g/2 (assuming LZ constant). Suppose that the contribution to the 
power in a band df due to the reflectors in a range d¢ is 


de Ly 


P(y)af = Cop 


(20) 


where C' is a constant. Substituting for dy/df and L,/L then gives 


P(f) _C sing/2 


x sin (a — ¢g)’ a1) 


which except fcr the constant is identical to (17). 


5.3 The Theoretical Spectrum Method 


Consider again the plots of fade rate f vs ¢ shown in Fig. 4. For any 
particular f < fmax = max (fi, fo), there are either two or four cor- 
responding values of reflector directions ¢. For each of these g’s the 
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corresponding value of the weighting function can be found from (18) 
and is seen on the W vs ¢ plot directly under the f vs ¢ plot. We will 
assume that the mobile station is under the influence of one reflector at 
a time; this condition has a strong tendency to be true in suburban 
residential areas. Then, if we further assume that all reflector directions 
are equally likely, the power density at the frequency f will be propor- 
tional to the sum of the two or four values of W. 

The basic procedure for generating a theoretical spectrum for com- 
parison with a spectrum computed from experimental data is, if @ is 
constant: 

(7) Select a list of frequencies f = nAf, where Af = (fioia/A1)/ 
(2V/r-) and n = 0,1,2--- ; fia is the folding frequency (Ref. 2, p. 
117 ff.) of the experimental data, M is the number of lags (Ref. 2, p. 
120 ff.) used in computing its spectrum, and (2V/),) is the correspond- 
ing fm’. In other words, select the same frequencies, normalized by di- 
viding by fn’, at which spectral estimates were computed for the ex- 
perimental data. The reason for this matching of frequencies will be 
discussed below. 

(it) For each frequency, determine the two or four reflection direc- 
tions yg, using (11) or Fig. 4. Then for each frequency determine the 
corresponding two or four weighting functions W from (18); each W 
should of course be limited to Wimax . 

(ai) At each frequency, sum the two or four corresponding values 
of the weighting function W to get the spectral power density. 


The solution of (11) in step (27) above can be accomplished by Newton’s 
iteration procedure. Also, the symmetry of the f vs ¢ function about 
¢ = a and 180 + ecan be used; if g: < a isa solution, g. = 180 — 91; 
and, if 2a < ¢; < 180 + a@ is another solution, g; = 360 + 2a — ¢;. 
In the case where a = O (or 180°) an explicit solution for the two ¢’s 
and the sum of the two W’s can be obtained. Setting @ = 0 in (11) gives 


a a aa © (22) 
Ee / J sim 9? 
and the solution y/2 = aresin f?. Setting a = 0 in (18) gives 
W = Saxton . (23) 
2 cos o/2 


Because (23) is symmetrical about g = 7/2, the two W’s are equal, and 
(22) and (23) combine to give the spectral density as 








MOBILE RADIO FADING » 2947 


; 1 
P(f) = min (217 ex EF =) ; (24) 


where the physical limit on W is included. 

Implicit in the procedure thus far is the assumption that fading 
waveform contributed by each reflector is sinusoidal. The resulting 
power spectrum is zero above finax . Actually, the fading waveform due 
to a single reflector has an harmonic content which depends on the 
relative amplitudes of the direct and reflected signals; when one is much 
smaller than the other the fading tends to be sinusoidal, and when they 
are equal the fading waveform is a full-wave rectified sine wave. This 
can be seen by superposing the incident and reflected electric field com- 
ponents; in terms of the z coordinate of Fig. 1, the resultant electric 
field of a vertically polarized wave has the form (see Ref. 1, p. 296) 





|B | = | sin (2 cosa), (25) 


c 


where K is a constant and @ is the angle of incidence. The spectra of the 
experimental data all exhibit fall-offs subsequent to the fall corresponding 
to fmax. The harmonic content of the fading can be included in the 
theoretical spectrum by determining the harmonic power corresponding 
to each original theoretical spectral estimate and adding this power in 
at the corresponding set of harmonic frequencies. Arbitrarily, the coeffi- 
cients for a full-wave rectified sine wave were used to determine the 
relative power at the harmonic frequencies; this will provide a maximum 
of harmonic power. It is an interesting fact that inclusion of harmonic 
power does not significantly alter the shape of the theoretical spectrum 
at frequencies below fimax . 

The final step in generating the theoretical spectrum is to smooth 
it in an appropriate way. Because the spectra computed from experi- 
mental data are estimates of smoothed versions of the true power spectra 
(see Ref. 2), the theoretical spectra should be smoothed in a correspond- 
ing way. Therefore the theoretical spectra to be shown will have been 
smoothed by hanning.” This is the reason for matching the theoretical 
and experimental spectral estimate frequencies. 

Finally, if the relative path angle @ varies during the run, its varia- 
tion can be represented by a weighted list of sample a’s. The spectrum 
for each a can be determined and the resulting spectra averaged. The 
smoothing can be done after averaging. 
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5.4 Theoretical Spectra for Various Constant a’s 


Fig. 5 shows theoretical spectra for a = 0°, 30°, 60°, and 90°, V = 
15 mph, f. = 838 me, and Wimax = 15. The corresponding fm’ = 37.5 
cps. Consider first the curve for a = 60°. The peaks corresponding to 
the relatively heavily weighted frequencies in the vicinity of f; and fo 
(fi’ = 9.4 eps, fo’ = 28.1 cps) are clearly evident. Following f2’, the 
power density falls sharply and levels off abruptly to form a shelf. The 
shelf, which arises primarily from second-harmonic power, peaks around 
56 eps prior to a second sharp fall. Following the second-harmonic shelf 
is one due primarily to third harmonics which ends at about 84 eps. 
Because the points in the fundamental frequency portion of the spectrum 
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Fig. 5 — Theorctical fading waveform power spectrum vs relative vehicle di- 
rection a. Theoretical spectral estimates are 1-cps apart. 
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were computed at 1-cps spacings, the harmonic shelves get increasingly 
jagged-looking at higher frequencies. The dashed line sloping down 
through the shelves is a least squares straight-line fit to the portion of 
the spectrum following the first sharp fall. The falloff line in the a = €0° 
case has a slope of —13.0 db/oct. 

The peaks in the a = 30° case correspond to f;’ = 2.5 eps and fo’ = 
35.0 eps. When a = 0, f;’ = 0 and f2’ = 37.5 eps. And when a = 90° 
the peaks unite at fi’ = fo’ = 18.75 eps. The least square fall-off lines 
have slopes that generally fall between 12-13 db/oct. 


5.5 Theoretical Spectra for a Uniformly Distributed 


Fig. 6 shows the result of averaging the spectra for a’s uniformly 
distributed 0-360° (spectra for a = n2°,n = 0,1, --- , 45, were aver- 
aged). The spectral density is quite flat out to 37.5 eps, where it drops 
abruptly about 16 db to the second-harmonic shelf. The harmonic 
shelves in this case are also quite flat. The least squares fall-off line is 
shown and has a slope of —13.2 db/oct. 


VI. DATA COLLECTION 


The fading waveforms due to vehicle motion were recorded (on FM 
tape with an Ampex F'R100) for 17 runs on 18 different streets (runs on 
some streets were made in both directions) in New Providence, N. J., 
on Sept. 18, 1962. Transmission at 838.032 me was from the mobile 
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Fig. 6 — Average theoretical power spectrum for a uniform distribution of 
relative vehicle direction a. 
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vehicle (a Volkswagen Kombi) traveling nominally 15 mph, to a fixed 
station at the Murray Hill Laboratories. The range was between 1 and 2 
miles and varied little during any run. The duration of the recorded 
waveforms ranged from about 20 to 150 seconds. Values obtained for 
the parameter angle a ranged from 6° to 90°. All of the streets were in 
suburban residential areas except Central Avenue, which serves open 
fields and a few single-story industrial and commercial buildings. The 
weather was clear and dry. 

The vertically polarized transmitting antenna atop the vehicle was a 
stack of 24 coaxial dipoles with a net gain of about 4.5 db. The interac- 
tion with a second similar antenna several wavelengths away is not 
known, but is believed to be small. The receiving antenna was a verti- 
cally polarized 13-element coaxial array mounted atop a rooftop ele- 
vator house. It had about 11 db gain and a 3-db beamwidth of about 
6°. 

A voice channel was recorded simultaneously with the fading signal 
on a second I'M tape channel. This channel carried a running com- 
mentary describing the data and included start- and end-of-data an- 
nouncements. Also recorded on this same channel were tone bursts 
triggered every nominal 0.01 mi by a cam attached to the speedometer 
cable. The exact vehicle speed was ultimately recovered from these 
bursts. 

To permit over-all calibration of the static transfer characteristic 
of the system, calibration levels 3 db apart over a 60-db range were 
recorded both prior and subsequent to the recording of the fading sig- 
nals. The two stations were directly connected by coax for calibration. 
Each level was recorded for a few seconds along with appropriate voice 
announcements. 

A complete set of Visicorder records were then made from the I'M 
magnetic tape of both the fading signals and the tone bursts for a pre- 
liminary examination of the data and for later determination of the 
vehicle speeds. 

The pertinent parameters for data runs whose power spectra are shown 
in this article are given in Table I. The system bandwidth was limited 
by the receiver, which was 3 db down at 310 cps. The angular elevation 
above the horizon of the fixed station as viewed from the mobile station 
was usually between 1° and 3°. 


6.1 Vehicle Speed 


Four timed test runs were made in the vehicle over a fixed, level 
course of 1443 ft., to determine typical speed variations during a run and 
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TaBLeE [— ParaMETers OF REcoRDED Data FOR WHICH 
CoMPARISONS ARE SHOWN BETWEEN THEORETICAL 
AND EXPERIMENTAL SPECTRA * 
































Ay Alpha (deg) , , 

g. 

Case No. Street Speed Smax J, Fig. No. 

mph : cps cps 
min max avg. 

1 Commonwealth] 15.8 | 82.2 | 84.5 | 83.3 22.4 39.5 10 
2 Charnwood 15.8 | 73.0 | 75.8 | 74.4 | 25.5 39.5 11 
3 Whitman 15.2 67.8 | 80.4 | 74.2 26.2 38.0 12 
4 Elkwood 16.2 41.0 | 42.6 ] 41.8 35.4 40.4 13 
5 Ridgeview 15.8 |167.8 |168.4 |168.1 39.2 39.6 14 
6 Ridge 15.9 15.0 | 15.8 | 15.4 38.9 39.7 15 
7 Central 16.0 68.7 | 72.9 | 70.8 27.2 39.9 16 








* All spectra were computed using 5000 sample points (20 sec at 250 points/sec). 


to calibrate the tone burst rate. The nominal speed for each run was 
15 mph = 22 ft./sec. Visicorder recordings were made of the bursts 
from the FM tape to enable counting them and measuring their spacing; 
the Visicorder paper speed was determined to be 1.019 in./sec using a 
10-cps square wave (set by frequency counter). The tone burst rate 
was found to be 51.98 + 0.16 ft. between beginnings of bursts. Using 
this burst rate, the averages and standard deviations of the speed during 
these four runs are shown in Table II. 

The actual average vehicle speed during each data run or part of a 
run was determined from the Visicorder records which have the tone 
bursts plotted alongside the fading signal. Let Ng be the number of 
bursts occurring during a part of the run and Dz (inches) be the corre- 
sponding length of Visicorder paper. The average vehicle speed Sz 
for that part of the run was then computed from Sz = 53.0 Ne/Dr 
ft./sec. 


6.2 Location of Data Runs 


The precise location of each run was carefully marked on a set of 
topographic maps (100 ft = 1 inch) which showed actual street and 
house shapes. Typically, street intersections and poles were used as 
starting and ending points. Except in the case of Whitman Road, 
which is slightly S-shaped, the vehicle was driven in a straight line. 


VII. PATH TRAJECTORY DATA 


The set of topographic maps referred to previously have a common 
coordinate grid. By determining the coordinates of the starting and 
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Tasue IJ — AVERAGES AND STANDARD DEVIATIONS OF SPEEDS 
Durine Four Test Runs 








Test Run Avg. Speed (ft./sec) Std. Dev. (ft./sec) 
1 23.15 0.638 
2 23.08 1.09 
3 23.16 0.54 
4 23 .43 0.41 
Average 23.21 0.67 





ending points of each run, and the coordinates of the fixed station, it is 
possible to compute the vehicle azimuth (path azimuth), the azimuth 
of the direction from the fixed station to the mobile station (fixed azi- 
muth), the angle a between the direction of the vehicle and the direction 
to the fixed station (positive if fixed station is to the left of the vehicle), 
and the range at various points along the run. Except in the case of 
Whitman Road, the end points were connected with a straight line 
which was then divided into 50-ft. intervals (the last interval usually 
extending past the original end point). The value of a was then tabu- 
lated for the distances 50 ft. (n = 0,1,2,---) along each run. It 
should be noted that even with a straight path @ varies because of the 
finite distance to the fixed station. 

In the case of Whitman Road, where the path trajectory follows the 
shape of the road and is not straight, a larger map (50 ft. = 1 in.) was 
used which showed the actual azimuth variation along the street. 


VIII. DIGITAL PROCESSING 


Following digitization of the fading data, the calibration, plotting, 
filtering and spectral analysis were accomplished on an IBM 7094. 
Many computer programs and subroutines were written for these 
purposes as well as for such auxiliary purposes as calibration curve 
fitting, magnetic tape searching (subroutines that can conveniently 
retrieve requested data pieces), spectra equalizing and plotting, and 
vehicle path angle determination. An available set of time series process- 
ing subroutines® was extensively used; this set included subroutines 
for tapering and detrending data, and for computing auto- and cross- 
covariances and Fourier transforms. A large arsenal of subroutines was 
eventually amassed, and writing a program for some particular task 
became the relatively simple job of writing a program to call appropriate 
subprograms. 
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IX. INITIAL DATA PROCESSING 


9.1 Digitization 


Both the fading waveforms and the recorded fixed calibration levels 
were digitized on an analog-to-digital converter within the Laboratories,’ 
using 11 bits/sample and sampling at 500 cps. The procedure for digitiz- 
ing consisted of playing back the analog tape, listening to the voice- 
channel announcements, and manually triggering the digitizer on and 
off at the indicated times. Approximately 2-second intervals of each 
calibration level were digitized. The signals were filtered prior to sam- 
pling by a passive filter which was 3 db down at 250 cps, 10 db down at 
300 cps, and subsequently fell 36 db/octave. The folding or Nyquist 
frequency (see Ref. 2, p. 30 ff.) of 500/2 = 250 cps was chosen to safely 
contain the expected power spectra. 


9.2 Microfilm Plotting 


The digital data was read into the 7094 and completely plotted on 
microfilm on a peripheral General Dynamics 4020 microfilm printer. 
This provided a good visual record of the raw data as well as a check 
on the digitizing process. A computer subroutine was developed which 
generates a long continuous plot down the length of the microfilm. 
Such plots were produced for monitoring after every step in processing 
or transcribing the data. The comparative ease with which large quanti- 
ties of digital data can be monitored by viewing microfilm considerably 
reduces the chance of the accidental processing and use of data con- 
taining errors. The 17 runs of recorded fading waveforms, which totaled 
over 920 seconds, yielded over 460,000 data points when digitized at 
500 cps. When plotted at 480 points per 35 mm frame, the complete 
data comprising 960 frames could be viewed in detail on a roll film viewer 
in about an hour. ig. 7, which exhibits a typical data section, was traced 
from a print from one frame of microfilm. 


9.3 Calibration 


The communication system nonlinearities, including that of the 
linear-to-log converter used during analog recording, had to be removed 
to obtain true signal amplitude. The before and after (the data) sequences 
of calibration level records were read on the 7094, and each record was 
averaged to remove noise and obtain a calibration point. Any system 
net drift during original data recording or during digitization would 
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make the before and after curves different. Fortunately, they were 
quite similar and they were averaged to obtain the adopted calibration 
curve. A suitable function was then fitted, using a least squares criterion, 
to the list of calibration points. The adopted calibration function is 


Y = —60.255 + 0.8282 (X — 170.6)? — 0.01614 (X — 170.6) 
+ 8.474-10-§ (X — 170.6)? — 9.658-10-!° (X — 170.6), 


where X is the digital sample value and Y is the true signal in relative 
db. This function, which has a maximum error of 0.47 db near Y = —3 
db and rms error of 0.21 db, is shown in Fig. 8 plotted along with the 
original calibration points. Input values outside expected limits of 
X = 170.60 and 4041.06 were clipped to these values. The calibration 
program kept a statistical history of any clipped regions. The signal 
amplitude is then exp (0.11512926Y). 


xX. INITIAL ANALYSIS AND SECOND-STAGE PROCESSING 


10.1 Preliminary Power Spectra 


These were computed for several pieces of the data to determine 
whether any smoothing and decimating [Ref. 2, pp. 129-135] was 
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Fig. 7 — Typical section of data. 
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Fig. 8 — Original calibration points and fitted curve from (26). 


necessary or desirable. The power spectra computation will be dis- 
cussed later. Being sure to pick runs expected to have the widest band 
spectra, it was determined that the significant portions of the spectra 
were safely below one-half the folding frequency of 250 eps. Decimation 
by two (retaining every other point), which would reduce the folding 
frequency to 125 eps, would be safe and would reduce computation 
time. Suitable smoothing prior to decimating can also remove or reduce 
the 120-cps and higher hum peaks which were observed. Removing this 
hum is not essential for the spectral analysis, but doing so makes the 
data more suitable for level crossing analysis. 


10.2 Smoothing and Decimating 


The decimation of data retaining every Jth point, symbolically indi- 
cated by F, , multiplies the folding frequency by 1/.J. To prevent power, 
including noise power and hum power, at frequencies above the new 
lower folding frequency from folding over and appearing below this 
frequency, the data must be smoothed (or filtered) before decimation 
(see Ref. 2, pp. 129-135). 

The most economical type of smoothing in digital analysis is to com- 
pute straight running means of L consecutive values. Usually, simple 
sums which differ from the means by a factor are used to obviate division 
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by L. This smoothing, symbolically indicated by S, , is then 


4 


Y= Dd) X&, (27) 


j=i—L4+l 


and has the power transfer function 





. Lrf P 
sin oj 
Si(f) =|— +], (28) 
sin as 
2fr 
where f is the frequency and f, is the folding frequency. S,(f) has 
periodic transmission nulls at (f/fr) = 2n/L, where n = 1, 2,-:-. 


Because the loss between nulls is usually not too great, a common proce- 
dure is to smooth twice with S, followed by Sz4: (indicated by Sz4:S,); 
the second smoothing will have nulls tending to fall between those of 
the first. 

The processing chosen for the present data was F’.S;/4, — smoothing 
by threes and fours and then retaining every other point. The new 
folding frequency is 250/2 = 125 eps. The smoothing loss is plotted in 
Fig. 9 as a function of the fraction of the new folding frequency; the 
folded portion of the loss curve is also shown. Maximum loss occurs at 
125 (near 120), 167 (near 180), and 250 cps. The loss peak at 125 will 
make it impossible to obtain accurate spectral estimates close to the 
folding frequency, but this is not serious. Spectra can now be computed 
with the same resolution, stability and duration of data, for one-fourth 
of the computer time. 


XI. POWER SPECTRA 


The method employed in determining power spectral estimates is that 
described by Blackman and Tukey;? another good reference is Ref. 5. 


11.1 Spectral Computation Parameters 


The spectra to be shown were computed using 5000 points (20 seconds 
at 250 samples/second). The mean and a least squares linear trend was 
removed from the data sections used, and the first and last 5 per cent of 
each section were raised cosine tapered to zero. The autocovariances 
(mean lagged products) were determined for 100 lags (i.e. for lags of 
ndt, where n = 0, 1,--- , 100, and At is the sample spacing). A finite 
cosine transform of the autocovariances then provides spectral estimates 
125/100 = 1.25 eps apart from zero to the folding frequency. The spectra 
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Fig. 9 — Smoothing loss for S;S,; vs fraction of folding frequency. 


were smoothed by hanning (see Ref. 2, p. 98). Under these conditions, 
each estimate has a 90 per cent chance of being within about a 2-db 
range of the true spectrum. Or, the difference between the estimate and 
the true spectra has a variance of about (0.3 db)’. 


11.2 Computed Spectra 


Almost fifty different spectra were computed from the data collected. 
A representative set of these are plotted (circles) on Figs. 10-16, where 
they may be compared to corresponding theoretical spectra. The com- 
parison will be discussed later. All of the plotted spectra were equalized 
for the smoothing loss before plotting. It may be noted that most of 
the curves are not plotted beyond some frequency between 80-100 eps. 
The smoothing and decimating (/'2S3S4) produced an infinite-loss notch 
in the spectrum at 125 cps. When the spectrum is subsequently computed 
this hole is filled in by computation noise. The plot was automatically 
ended at the frequency where equalization of this noise started to pro- 
duce a meaningless result. Even so, the last few points plotted are 
inaccurate and tend to be lower than they should be. 
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Fig. 10— Experimental fading waveform spectrum compared with corresponding 
theoretical spectrum. Relative amplitudes are arbitrary; see discussion in text. 


11.3 Spectral Density Curve Shapes 


The spectra all exhibit significant power density out to some frequency 
between about 20 to 40 cps, where the density falls sharply between 10 
and 15 db and then more gradually at about 12 db/oct. Many show a 
distinct shelf between the sharp-fall frequency and the subsequent slow 
fall-off. The shelf generally ends with a noticeable sharp drop at a fre- 
quency about twice the earlier sharp-fall frequency. The shelf in Fig. 14 
has a noticeable peak prior to its fall at about 75 eps. The peak at 60 
cps in all the plots is power supply hum. Another significant common 
feature is the relatively narrow peak immediately preceding the sharp- 
fall frequency. Many of the spectra exhibit a noticeable broad peak 
below the narrow one. All of the spectra rise 10 to 15 db at low frequen- 
cies. 


XII. COMPARISON OF THEORETICAL AND EXPERIMENTAL SPECTRA 


To compute a theoretical spectrum corresponding to a particular 
experimental spectrum, it is necessary to know the carrier frequency 
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Fig. 11— Experimental fading waveform spectrum compared with corresponding 
theoretical spectrum. Relative amplitudes are arbitrary; see discussion in text. 


and vehicle speed in order to compute f, , and to know the variation 
in the parameter angle a. For the present comparisons, the carrier fre- 
quency is 838.032 me, and the average vehicle speed and corresponding 
fm are shown in Table I. The range of a’s represented in the comparisons 
is from about 12° to 83°; remember that the spectrum for a = 90° + x 
is the same as one for a = 90° — wz. The weighting function W was 
limited to Wmax = 15 (compatible with the typical run length of 450 
feet and typical house side length of 30 feet). 

For each theoretical spectrum, the corresponding list of a’s was used. 
Spectra were computed for each a and the final spectrum was the hanned 
weighted average. Harmonic power was included in each a’s spectra 
before averaging. A value of Wimax = 15 was used in all cases. Table I 
also lists the fmax corresponding to the value of a occurring during run 
that is nearest to 0° or 180°. As an example of how the list of a’s was 
used, consider Commonwealth Avenue (case 1). These data are actually 
points 1-5000 (250 pts/sec) of a longer piece. The vehicle speed accord- 
ing to Table I was 15.8 mph; thus the run was (20 sec) (15.8 mph) (22 
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Fig. 12— Experimental fading waveform spectrum compared with corresponding 
theoretical spectrum. Relative amplitudes are arbitrary; see discussion in text. 


ft./sec)/(15 mph) = 4638 ft. in length. The angles a, are values com- 
puted for distances d, = 50 n ft., where n = 0, 1, 2, --- and may be 
considered to represent the distance ranges d, - 25 ft. Thus the relative 
weights for the a, and the corresponding spectra are wi = 0.5, we to 
wy = 1.0, and wio = 0.76; the latter is (463 — 25 — 8.50)/50. When 
the data section does not begin at sample 1, the distance between sample 
1 and the starting sample must be determined using the proper average 
vehicle speed for that interval. 

The change in a during the data section is small enough in many cases 
to permit using the average a to compute the spectrum. For example, 
during the run of case 4 the angle @ varies only from 41.0° to 42.6°; a 
spectrum computed from the average value of about 41.8° differs little 
from one determined by averaging. In other cases — Whitman Road 
for example — the spectrum determined by averaging has much broader 
peaks than one corresponding to the average a. All the theoretical 
spectra to be shown were determined by averaging, whether this was 
necessary or not. 
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Fig. 18 — Experimental fading waveform spectrum compared with corresponding 
theoretical spectrum. Relative amplitudes are arbitrary; see discussion in text. 


12.1 Comparing Theoretical and Experimental Curves 


Figs. 10-16 show theoretical spectra (solid curves) superimposed on 
experimental spectra (circles); Table I lists the data sections involved 
and gives corresponding figure numbers. The only thing arbitrary in 
comparing the theoretical and experimental spectra is their relative 
amplitude. Thus the theoretical curve has been shifted vertically to 
produce some sort of fit. In all cases a transition is shown from the basic 
theoretical spectrum to a fall-off line fitted by least squares to the portion 
of the theoretical spectrum above fax ; this fall-off typically has a slope 
of —12 to —13 db/oct. The last few points of each experimental curve 
are not very accurate and tend to be low, as previously discussed. 


12.2 General Results of the Comparison 


Before reading further, the reader should make a superficial scan of 
Figs. 10-16. The agreement between the theoretical and experimental 
spectra is generally quite good. The main discrepancy is that the ob- 
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Fig. 14— Experimental fading waveform spectrum compared with corresponding 
theoretical spectrum. Relative amplitudes are arbitrary; see discussion in text. 


served rise in spectral density at low frequencies is not predicted by the 
theory. The sharp-fall frequency agrees very well in almost all cases. 
The peak prior to this sharp fall fits well in many cases. In cases where 
intermediate peaks are predicted (a not too close to zero), the experi- 
mental spectra usually exhibit them. The second harmonic shelf is 
well formed in many cases. The following are some comments on specific 
comparisons: 


12.2.1 Case 1; Fig. 10 


This street had an average a of about 83°. The two peaks have nearly 
merged and have formed a double peak which the experimental spectrum 
exhibits in agreement. The second-harmonic shelf is higher than pre- 
dicted and ends somewhat early; more will be said about this later. 


12.2.2 Case 2; Fig. 11 


Here a averages about 74°. The upper peak and the sharp fall agree 
well. The intermediate peaks are in only fair agreement. Because of 
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Fig. 15— Experimental fading waveform spectrum compared with corresponding 
theoretical spectrum. Relative amplitudes are arbitrary; see discussion in text. 


the absence of a second-harmonic shelf in the experimental spectrum, 
the theoretical fall-off line is plotted beginning with its intersection 
with the first fall. A slight rise above this line occurs out to nearly 40 
eps; a second-harmonic shelf would have to extend to about 50 eps. 


12.2.3 Case 3; Fig. 12 


The comparison here is similar to that discussed for case 2. This street, 
however, has an a which varies between 67.8° and 80.4° and averages 
74.7°. The comparatively broader theoretical and experimental peaks 
may be noted. 


12.2.4 Case 4; Fig. 13 


This street has a averaging about 42°. The intermediate peak at 
about 5 eps is discernible. The sharp fall occurs at about the right fre- 
quency but is not as steep as expected. The second-harmonic shelf is not 
noticeable. The theoretical fall-off line is picked up at its intersection 
with the theoretical shelf. 
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Fig. 16 — Experimental fading waveform spectrum compared with corresponding 
theoretical spectrum. Relative amplitudes are arbitrary; see discussion in text. 


12.2.5 Cases 6 and 6; Figs. 14. and 15 


These streets have a small a (about 12° and 15° respectively) and the 
intermediate peak is too close to zero frequency (below 1 eps) to show up 
with the present resolution. The main peak and the sharp fall agree 
exceptionally well and the second harmonic shelves are well exhibited. 
Furthermore, the predicted peak on the shelf appears in the experi- 
mental spectrum of Fig. 15 (and to a lesser extent in Fig. 14). 


12.2.6 Case 7; Fig. 16 


The agreement between the theoretical and experimental curves is 
in this case relatively poor. This street, however, is not in a suburban 
residential area, and has only a few low industrial buildings spaced well 
back from the curb. The section of the street corresponding to Fig. 16 
had an average of a of about 71°. It was observed that, if experimental 
spectra were computed for other portions of the street with different 
a’s, these spectra were quite similar if their ripples were ignored. 
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12.3 Additional Observations 


It has been observed that the streets having the best agreement be- 
tween experimental and theoretical spectra in the vicinity of the higher 
peak, sharp fall, and harmonic shelf are those with @ near 0° or 180°. 
A good reason why this is not too surprising is offered in Section XIII 
under a discussion of nonrandom reflector orientation. 

A cause for the small but discernible drop in Figs. 10, 11, 12 and 16 
at the frequency corresponding to f,,’ is offered in Section XIII under a 
discussion of simultaneous reflections. 

When the harmonic content was included in the theoretical spectra 
it was assumed that the reflectors were perfect conductors. The obser- 
vation of second-harmonic shelves at the predicted amplitude level in 
many cases indicates that the assumption was reasonable. The use of an 
aluminum foil vapor barrier integral with outside wall insulation is 
common in current house construction and may explain their good 
reflectivity. 


XII. LIMITATIONS OF THE MODEL 


In the preceding section it was seen that a major deficiency of the 
theoretical model is its failure to forecast the rise in spectral density at 
the low-frequency end of the spectrum. Some of the mechanisms that 
can contribute low-frequency energy are discussed in the following 
paragraphs. 


13.1 Shadowing by Buildings 


The shadowing of the direct signal by buildings introduces into the 
fading waveform a low-frequency multiplicative function (likely with 
some harmonic content) with a fundamental spectrum probably not 
extending much beyond about 14 eps (houses spaced 80-100 ft. apart 
and a vehicle speed of about 15 mph). The resulting fading waveform 
spectrum would be the convolution of the spectrum without shadowing 
with that of the low-frequency function. Such a low-frequency multi- 
plicative function was observable in some portions of microfilm plots 
of recording fading waveforms. It is not likely that this effect explains 
the entire extent and shape of the low-frequency rise. 


13.2 Ground Reflections 


These cause standing-wave patterns which normally vary only in a 
vertical direction. But as the vehicle moves the point on the ground 
causing the ground reflection moves and the reflectivity will vary. This 
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and the shadowing of ground reflections by buildings introduce low- 
frequency variations in the direct signal. 


13.3 Simultaneous Reflections 


An assumption inherent in the construction of the theoretical spectra 
was that the mobile vehicle was under the influence of one reflector at 
a time. The simultaneous presence of more than one reflected signal 
will give rise to additional beat frequencies in the fading waveform 
between the reflected signals (see Section IV). Here the Doppler point 
of view is useful. From (14) the Doppler-shifted reflected signal is seen 
to lie in the frequency range f, = f. & V/d. ; thus, if all possible reflec- 
tions are always present, the radio frequency spectrum would have a 
bandwidth 2V/d. = fm’. The Doppler shifted direct signal has from 
(13) a frequency of f; = fe + (V/X-.) cos a. The shape of the spectrum 
is not symmetrical about f,. This shape can be obtained by picking 
frequencies f, between f, — V/A, and f. + V/d., solving (14) for the 
two values of ¢ corresponding to each frequency, and then summing the 
two corresponding values of the weighting function IV obtained from 
(18). The result is a spectrum having a broad minimum at f, = fi = 
fe + (V/d-) cos a and peaks at f, = f; + V/d.. Thus, as a varies from 
a = 0 to 90° to 180°, the direct signal f; moves from the upper-frequency 
end of the spectrum to the center and to the lower end. Fig. 17 shows 
the radio-frequency spectrum as a function of a (the peaks appear sharp 
because no smoothing has been applied). If the direct signal is large in 
amplitude compared to all the reflected signals, the spectrum of the 
envelope would essentially be that obtained previously (Section V), 
except for the lack of harmonic content. If the direct signal is ignored, 
the spectrum of the envelope would be the convolution of the radio- 
frequency spectrum with itself (see Ref. 6, Chap. 12); this spectrum, 
which would vary from a maximum at zero frequency to zero at 2V/X. , 
would be virtually independent of a. The fact that the convolution 
would carry the spectra only out to 2V/). suggests that beats between 
reflections may be responsible for the partial filling in of the second- 
harmonic shelf when a is not near 0 or 180°. This effect is noticeable to 
various degrees in Figs. 10, 11, 12 and 16, where a perceptible drop 
occurs at fm independent of any termination of the second-harmonic 
shelf at 2fmax. Similarly, this effect probably decreased the steepness of 
the observed steep fall in Fig. 13. It remains a fact that in a suburban 
residential environment, the spacing of houses is such that there is a 
very strong tendency for a mobile vehicle to experience only one domi- 
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Fig. 17 — Radio-frequency power spectra for a range of relative vehicle direc- 
tion a. 
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nant reflection at a time. There certainly is some continual overlap of 
reflected beams and this may in part be responsible for the low-frequency 
rise in the experimental spectra. 


13.4 Nonrandom Reflector Orientation 


Houses are generally built with their larger flat sides parallel and 
perpendicular to the street. It has been shown from (11) that a zero 
fade rate obtains when ¢ = 0°, 2a, and 180°; low-frequency fades obtain 
when ¢ is near these values. The weighting function W is zero for g = 0° 
and 180° (except when a = 0° or 180°), but is unity at g = 2a. Clearly, 
it is the reflectors whose flat sides are roughly parallel to the direction 
of vehicle travel that contribute the low-frequency fade rates; each such 
reflector makes its contribution when the vehicle position is such that 
its g & 2a. Conventional house orientation obviously increases the 
supply of reflectors causing low-frequency fades above that under ran- 
dom conditions. It is felt that this effect may be the most important 
reason for the low-frequency rise in experimental fading spectra. 

The same nonrandomness will affect the experimentally observed 
spectral peaks. These peaks occur when ¢ = a and a + 180°; the corre- 
sponding required reflector orientations, relative to the direction a, 
are a/2 and 90 — a/2. When a is small or near 180°, these required 
relative orientations are near 0° and 90° —i.e., near parallel and per- 
pendicular to the direction of travel. When a@ is near 90°, the required 
relative orientations are both near 45°. Thus observed peaks for a’s 
near 90° may be relatively subdued by the relative absence of required 
reflectors. This effect has been observed in several comparisons of 
theoretical and experimental spectra. 


13.5 Other Low-Frequency Effects 


Nonuniformity of the fixed station antenna pattern and reflectors in 
the vicinity of the fixed antenna can produce some very low-frequency 
variations in the standing wave pattern. Motion of the fixed antenna 
and trees due to wind are additional sources of low frequencies in the 
standing wave pattern. 


XIV. ADDITIONAL TOPICS 


14.1 Moving Reflectors 


All of the preceding discussion was concerned with fixed reflecting 
objects. The experimental data were taken with the streets devoid of 
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other moving vehicles. What effect will the motion of other vehicles have 
on the spectrum? Let us limit this discussion to vehicles moving in the 
same or opposite direction on the same street as the mobile station; 
let Vi and V2 be the speeds of the mobile station and moving reflector 
respectively with a positive V. corresponding to closure between the 
vehicles. The moving reflector encounters and reflects a frequency 


V2cos a 


29 
——s (29) 


fo = fe — 
where f., Ac, and @ have their previous meanings, and V2 has a com- 
ponent away from the fixed station. The mobile station encounters a 
reflected beam of frequency 


Vit 


de ee) 


fi = fet 
where dz corresponds to fz, and encounters a Doppler shifted direct 
signal of frequency f; given by (13). The beat frequency f between f; 
and f, is then 


V V. V 
Pah fia ft they, - aS" 
Vit Ve _ (Vit V2) cosa ‘ 
SS (31) 
~ Ott — cosa), 


where c is the velocity of light. This result corresponds to (16) with 
gy = a. Thus an oncoming vehicle with V2 = Vi could double the maxi- 
mum observed fade rate. 


14.2 Horizontal Polarization 


In terms of the x and z coordinates of Fig. 1, the electric field com- 
ponents in the reflected beam region will take the form (see Ref. 1, p. 
295) 





EL, = jK cos 6 sin (= cos a) exp (-i 2zx sin @) (32) 


EK, = K sin 6 cos (= cos a) exp (i ora sin) ; (33) 





where K is a constant and @ is the angle of incidence. If a nondirectional 
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receiving antenna is used, such as is approximated by a ‘‘turnstile”’ 
consisting of two perpendicular dipoles connected together by a 2,/4 
stub, the received signal can be shown to be proportional to 


K sin (2 cos § — a) | (34) 


Cc 


|Z |= 








irrespective of the angular orientation of the turnstile. This result has 
the same form as (25) except for the spatial phase shift ¢. Thus the use 
of horizontal polarization together with the assumed antenna produces 
an effective standing-wave pattern that is identical to that for vertical 
polarization except for the spatial translation. Thus the fading situation 
would also be identical. 


14.3 Field Component Diversity 


Equations (32) and (33) show that H#, is a maximum where EL, is 
zero and conversely. Suppose the two dipoles of the turnstile antenna 
are not connected together with a stub but are offered simultaneously 
to the receiver. Then, if the receiver electronically switched to the dipole 
offering the greater signal, the receiver would in many cases never experi- 
ence a null. However, when the direction to the reflector ¢ approaches 
180°, 6 approaches 0° and the component £, becomes smaller and van- 
ishes; likewise, /, vanishes when ¢ approaches 0. Thus any diversity 
scheme dependent on choosing between , and #, would be most suc- 
cessful near g = 90° (@ = 45°) and unsuccessful near ¢ = 0° or 180° 
(@ = 90° or 0°). 

At every point in the reflected beam region it would be possible to 
rotate a dipole to a position where a maximum signal is picked up. This 
is not possible only when ¢ = 0° or 180° exactly. It is possible, therefore, 
for a mobile dipole that is mechanically or electronically rotated con- 
tinuously to receive a maximum signal, to reduce the amplitude of the 
fading due to vehicle motion, if horizontal polarization is employed. 
If the angular position of a mobile single dipole is fixed, the fundamental 
fading rate experienced is still the same function of a and ¢ as before, 
except that the amplitude of the fading will vary because of the direc- 
tivity of the dipole. For example, if g = 90° and the dipole is physically 
oriented perpendicular to the fixed station, the mobile dipole may be 
translated anywhere in the reflected beam region without any fading. 
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Digital Data Signal Space Diagrams 


By J. R. DAVEY 
(Manuscript received April 20, 1964) 


Signal space diagrams are described which show the pattern of amplitude 
and phase variation for several kinds of modulated carrier signals commonly 
used in digital data transmission. Such diagrams illustrate important 
similarities and differences among the various modulation methods. Oscil- 
loscope pictures of actual data signal patterns are presented, and it 1s shown 
that these patterns can be used to detect the presence of amplitude and delay 
distortions in the transmission channel. 


I. INTRODUCTION 


Many different kinds of modulated carrier signals are being used in 
digital communication systems. All these various data signals can be 
expressed in the general form A(t) cos [w.é + @(é)] where a carrier cos wet 
is varied in amplitude by A(t) and in phase by ¢(¢). The various modu- 
lation methods impart different patterns of amplitude and phase varia- 
tion. The characteristic pattern of a given modulation method can be 
portrayed by a polar plot of A and ¢ in which the angular reference is 
wt. This type of plot will be referred to as a “signal space diagram.” 

Signal space diagrams will be described for several kinds of carrier 
modulation. Only synchronous signals consisting of sequences of evenly 
spaced symbols will be considered. In each case the received symbols 
can be thought of as a sequence of carrier pulses or bursts, each with an 
envelope shape determined by the channel characteristic. In such a view 
the differences among the types of modulation are due to the number of 
pulse amplitudes and phases which are used, the particular phase se- 
quences used, and the spacing between pulses. In order to obtain the 
simplest signal space diagram it is desirable to choose the reference w, 
as the center of the received pulse spectrum so that the phase variation 
of a single isolated pulse will be minimized. With a symmetrical pulse 
spectrum and a linear phase characteristic this reduces the pattern of 
a single isolated pulse to a radial line. When the phase of the carrier 
varies from pulse to pulse and the pulses overlap, more complicated 
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patterns are formed. By transmitting a random sequence, a pattern of 
all the permitted amplitude and phase variations of that particular type 
of modulation is obtained. 

Space diagrams will be presented for a number of commonly used data 
signals. In these examples the pulse envelope has been taken to have a 
raised-cosine shape in time in order to make the diagrams consist of 
circles and straight lines. This is a close approximation to the case of 
a raised-cosine pulse spectrum which is often typical in actual data 
systems. The pulse spacing, 7’, for the double-sideband examples is 
equal to the reciprocal of the half-amplitude width of the pulse spectrum. 
This corresponds to the maximum rate which avoids intersymbol inter- 
ference as described by Nyquist. Pulse spacings of 7/2 are used in FM 
and VSB methods where special conditions are established to avoid 
intersymbol interference. 


II, SIGNAL SPACE DIAGRAMS FOR VARIOUS TYPES OF MODULATION 


2.1 Amplitude Modulation 


The first example is for on-off AM where mark is represented by a 
pulse and space by no pulse. The carrier phase remains the same from 
pulse to pulse, thus resulting in a straight-line pattern as shown in Fig. 
1. The signal positions at the mid-symbol sampling instants are indi- 
cated by points M7 and S which are separated by the pulse amplitude A. 
The shape of the pulses is indicated at the right in the figure. 

The diagram for suppressed-carrier AM or two-phase signals is shown 
in l'ig. 2. In this case a pulse is sent for both mark and space, but the 
carrier phase for space is opposite to that for mark. Again the diagram 
is a straight line, but the mark and space sampling points are separated 
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Fig. 1 — On-off AM. 


SIGNAL SPACE DIAGRAMS 2975 


ae a 
7 M 
eo 
| t 
A/2 Ase 
| yy 
| 
HALF ENVELOPE 
tae! 
M = MARK 
S = SPACE 
S 


Fig. 2 — Suppressed-earrier AM or binary PM with 0° and 180° changes. 


by twice the distance of the pulse amplitude. The positive half of the 
envelope and the carrier phase for each pulse are indicated at the right. 
A minimum separation of A is obtained with a pulse amplitude of A/2. 
As compared with the on-off case of Fig. 1, the same margin against 
noise is obtained with 3 db less average power and 6 db less peak power. 


2.2 Phase Modulation 


Binary phase modulation where the choice of phase change is 0° 
or 180° results in the diagram of Fig. 2, as noted above. Alternatively, 
the choice of phase change can be +90°. This has the advantage of 
symmetry and less amplitude variation. The diagram for this type of 
signal is a square, as shown in Fig. 3. The signal can move in either 
direction around the square and at the centers of the symbols is at one 
of the corners. Since there is always a 90° change between symbols, the 
signal alternates between corners marked with dots and those marked 
with circles. For a peak signal of A/2 there is a minimum separation 
between dot positions or circle positions of A, as was the case in Fig. 2. 
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Fig. 3 — Binary PM with +90° changes. 
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Note that the separation of interest 1s that between alternative choices 
for a given pulse rather than that between successive pulses. 

Diagrams for two cases of quaternary phase modulation are presented. 
When the phase change between symbols is 0°, +90° or 180° the diagram 
is a square with diagonals as shown in Trig. 4. The signal can progress 
around the square in either direction, go across a diagonal or remain at 
one corner with no restrictions. The four possible positions at the centers 
of the symbols are indicated by dots. For a minimum separation, A, 
between states the pulse peak becomes A/+/2. This indicates that for 
this quaternary system to have the same noise margin per decision as 
the two-phase signal of Fig. 2 the power must be increased 3 db. This 
type of signal is equivalent to the sum of two AM suppressed-carricr 
signals at quadrature phase. 

When phase changes of +45° or +135° are used between symbols 
there are eight possible phases for the pulses. The possible positions of 
the signal vector at the symbol centers are shown as dots and small 
circles on the diagram of Fig. 5. There is always a phase change between 
symbols, and the signal must alternate between dot positions and circle 
positions. With a peak pulse amplitude of A/+/2 the minimum separa- 
tion between dots or between circles is again A, as in the previous case. 


2.3 Vestigial Sideband 


It is assumed that the pulse spectrum for vestigial sideband has the 
same raised cosine shape used in the previous examples. It is also assumed 
that the pulse rate is twice the Nyquist rate for double-sideband opera- 
tion and that the pulses originate from the modulation of a suppressed 
carrier higher in frequency than midband by an amount equal to one 
quarter of the pulse rate, as indicated in Fig. 6. This results in a phase 
change between adjacent pulses of +90°. As shown in Fig. 6, the pulses 
overlap to the extent that at the peak of one pulse the adjacent pulses 
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Fig. 4 — Quadrature AM or PM with 0°, +90°, or 180° changes. 
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Fig. 5 — PM with +45° or +135° changes. 


are each at half amplitude. This severe interference is at quadrature 
phase to the wanted pulse and is eliminated by the use of coherent de- 
tection. The signal phase at the center of a symbol is not affected if the 
two adjacent pulses are of opposing quadrature phases but is perturbed 
+45° if the adjacent pulses are of the same quadrature phase. For 
example, on the diagram of Fig. 6 the center of a marking symbol can 
occur at any of the three dot positions at the top of the diagram de- 
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Fig. 6 — VSB: raised cosine pulse spectrum showing location of VSB carrier. 
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pending on the adjacent symbols as indicated. The phase of the coherent 
carrier used for detection advances around the diagram by 90° each 
symbol in the point sequence 1, 2, 3, 4. A continuous marking signal 
consequently follows this same sequence. A continuous spacing signal 
likewise advances 90° during each symbol but remains opposite in phase 
to the coherent reference. 

For continuous mark-space alternations each symbol pulse is retarded 
by 90° from the preceding pulse, and the signal moves around the circle 
in the opposite direction from steady mark or space. The signal always 
alternates between dot and small circle points. The corners of the square 
portion of the diagram are both dot and circle points, and the signal 
may rest at such a point continuously and represent a MMSSMMSS 
sequence. All changes in direction of rotation about the diagram occur 
at the corners of the square; otherwise, the only restriction is for the 
alternation of the dot and small circle positions. Here again, with a peak 
signal of A/+/2, a minimum separation of A between mark and space 
dots or mark and space circles is obtained. Thus the speed is doubled at 
a cost of 3 db more power, as in the case of quaternary phase modulation. 
Note that the individual pulse amplitudes are A/2, as for the two-phase 
case, but that the pulse spacing is halved. For vestigial sideband opera- 
tion these pulses are sent serially, while for the quaternary phase case 
of Fig. 4 the pulses can be considered to be of amplitude A/2 sent two 
at a time. 


2.4 Frequency Modulation 


The binary rectangular wave frequency modulation case to be pre- 
sented here is the ideal one where the bit rate is equal to the frequency 
shift between mark and space. For a continuous mark or space signal 
this results in the signal changing phase 180° between successive symbols. 
Again it is assumed that the signal is shaped to give a raised cosine 
pulse spectrum. Such an I'M signal can be resolved into two components, 
a two-phase signal carrying the binary information and a quadrature 
component consisting of steady mark and space as indicated by the 
vector diagram of Fig. 7. This quadrature component can be con- 
sidered to consist of alternating +90° carrier pulses located between the 
0° and 180° pulses carrying the information. The diagram for such an 
FM signal is shown in Fig. 7. A continuous mark condition (lower 
frequency) causes the signal to move around the circle clockwise. A 
continuous space causes a counterclockwise rotation. At the center of 
the symbols the signal is at either point A or B. A frequency transition 
causes the signal to swing out to one of the points ‘‘x”’ and reverse the 
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Fig. 7 — FM. 


direction of rotation. For continuous reversals the signal swings back 
and forth through point A or B along a horizontal line. For such a se- 
quence of reversals the phase swing is +45°. 

Although the steady mark and space frequency components which 
impart the horizontal component of motion in the diagram carry no 
information, they do permit the detection of the signal on a frequency 
basis. The mark and space conditions are indicated by the direction of 
rotation at points A and B. The quadrature component of the signal 
represents half of the total power. Consequently an FM signal requires 
twice the power of a two-phase signal to produce the same minimum 
separation of the points A and B. The two-phase component of the FM 
wave can be detected by a coherent carrier to determine whether the 
signal is at point A or B. It will be seen, however, that this leads to a 
polarity ambiguity because of the nature of the encoding. Reversals of 
either phase can be represented by the signal being at point A or at 
point B for successive symbols. A change from point A to point B indi- 
cates no transition of the information wave. 


2.5 Duobinary Frequency Modulation 


The duobinary technique developed by Lender is a means of doubling 
the rate of sending binary information. The data are first differentially 
encoded so that a transition is made for a space symbol and no transi- 
tion for a mark symbol. The resulting double-speed binary signal is then 
passed through a frequency shift channel of the type just described for 
ordinary binary operation with no change in frequency shift or channel 
shaping. The signal can change phase a maximum of +90° during these 
half-length intervals. This results in both the in-phase and quadrature 
pulses carrying information. The diagram of Fig. 7 applies in part, but 
because of the double rate we are interested in more points of the pat- 
tern. For instance, for steady mark the signal moves around the circle 
in either direction and the receiver samples the signal not only at points 
A and B but also at points C and D as shown in Fig. 8. The occurrence 
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Fig. 8 — Duobinary FM. 


of a space symbol causes a frequency transition and the signal leaves the 
circle and reverses direction at one of the points labeled S. If there are 
two successive space symbols causing two frequency transitions, the 
signal pauses at an § point for one symbol interval and then continues 
on in the same direction of rotation. An odd number of successive space 
symbols leads to a reversal of rotation while an even number does not. 
The signal can thus proceed around the circle clockwise or counter- 
clockwise or pause at one of the $ points. The rotating conditions repre- 
sent the high- and low-frequency states while the pausing represents 
the midband frequency. When the signal is detected on a frequency 
basis, a three-level baseband output is obtained, with the outer levels 
representing mark and the center level space. 

The complete duobinary diagram of Tig. 8 is seen to be the same as 
that of Tig. 6 for a vestigial sideband signal. This indicates that the two. 
kinds of line signals are of the same form although the encoding is 
different. Experimental verification of this identity has been demon- 
strated by transmitting a vestigial sideband signal to an FM receiver 
and obtaining a three-level baseband signal such as received in duobinary 
I'M. Fig. 9 shows a photograph of the received eye pattern. 


Ill. OSCILLOSCOPE PRESENTATION OF SIGNAL SPACE DIAGRAMS 


Signal space diagrams of actual data signals can be displayed by 
coherently detecting both the in-phase and the quadrature components 
with respect to a midband reference frequency and applying them to 
the X and Y deflection circuits of an oscilloscope. Such an arrangement 
was constructed in the laboratory and used to obtain the signal pattern 
photographs shown in Fig. 10. Three kinds of signals are shown, (a) 
binary I'M, (b) quaternary PM, (ce) binary VSB. These were all voice- 
band data signals within a band centered near 1800 cps. Appropriate 
filters were used to shape the pulse spectrum closely to a raised cosine. 

To appraise the possible value of such signal patterns as a measure of 
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Fig. 9 — Oscilloscope picture of three-level eye obtained by receiving a VSB 
signal on a 202B FM receiver. 


signal quality, the effects of amplitude slope and delay distortion were 
observed. Examples of the results are shown by the oscilloscope pictures 
of Fig. 11. The simulated line distortion characteristics which produced 
these patterns are given in Fig. 12. The effect of amplitude slope is 
readily apparent for FM and VSB, where portions of the transmitted 
sequence result in the signal resting at the high-loss end of the band. 
This accounts for the smaller inner circular portion of the patterns. 
In the case of PM the pattern is changed but not at the mid-symbol 
sampling points. The effect of high-end delay distortion shows up as a 
rotation of one part of the pattern with respect to others. This is readily 
seen in the PM examples, where the portion of the pattern formed by 
repeated phase advances is rotated with respect to the portion formed 
by repeated phase retardations. 


IV. CONCLUDING REMARKS 


Signal space diagrams have been described for a number of commonly 
used data signals. These diagrams are useful in comparing data signals 





Fig. 10 — Oscilloscope pictures of signal space patterns for a 63-bit pseudo- 
random sequence: (a) binary FM, 1000 bits/sec, (b) quaternary PM, 2000 bits/sec, 
(c) binary VSB, 2400 bits/sec. 
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Fig. 11 — Oscilloscope pictures of signal space patterns showing effect of 
amplitude slope and envelope delay distortion. 


on a common basis without regard to specific detection techniques. 
Similarities and differences are revealed which may not otherwise be 
apparent and various possible detection methods can be visualized. The 
margin against noise with ideal detection methods is also indicated by 
the spatial separation of the sampling points. The signal power is indi- 
cated by the pulse amplitude and repetition rate. or example, the fore- 
going diagrams illustrate that binary FM, quaternary PM and binary 
VSB signals all give the same margin against noise for a given trans- 
mitted power. The binary I'M system, however, operates at half the 
speed of the other two for a given bandwidth. It has also been shown 
that a duobinary I'M signal has the same pattern as a binary VSB 
signal. The relative simplicity afforded by FM detection of such a signal 
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Fig. 12 — Amplitude and delay distortion characteristics used to distort the 
signal patterns shown in Fig. 11. 


as against coherent detection is accomplished at a loss of approximately 
6 db in margin against noise. 

Considerable information about the nature of the channel charac- 
teristic is also indicated by the signal diagrams. The use of signal dia- 
grams as an indication of signal quality is primarily limited, however, to 
the laboratory. The required synchronization with the midband fre- 
quency and symbol rate of the signals to be observed tends to make the 
method unsuitable for field measurements. 
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Using Digit Statistics to Word-Frame 
PCM Signals 


By J. R. GRAY and J. W. PAN 
(Manuscript received June 5, 1964) 


Framing of PCM signals can be accomplished by statistical means. For 
signal samples whose probability distribution tends to be concentrated at 
the center of the coding range, the second digit of the Gray code generated has 
a probability of mostly 1’s. This information can be used to frame PCM 
words. Three circuits are proposed that test this probability. Reliability and 
reframe time for each circuit are obtained either analytically or expert- 
mentally. The first circutt uses a pair of racing counters: one counts 0’s in 
the second digit and the other 0’s in the third digit of the Gray code. When the 
system is in-frame, the first counter seldom reaches full count before the 
second, whereas during out-of-frame either counter can reach full count 
first with equal probability. The second circuit uses a reversible counter 
which advances on a 0 and retards on a 1. When connected to the second 
digit of the Gray code, the preponderance of 1’s will keep the counter at or 
near zero count; when connected to any other digit, where the probability of 
a 1 ts at most 0.5, the counter will reach full count in a finite time. The 
therd circuit uses an RC integrator in place of the reversible counter: each 
O of the second digit generates a pulse to charge the capacitor and each 1 
permits the accumulated charge on the capacitor to decay. The action is 
similar to that of the reversible counter but ts difficult to analyze. Expert- 
mental framing performance is given for this circutt. 


I. INTRODUCTION 


When a signal is transmitted by PCM, the receiver must be able to 
group the serial pulse train into code words before it can properly re- 
cover the original signal. This process is called “framing.” It is also 
called “word synchronization,” as distinguished from bit synehroniza- 
tion where the time base of the individual pulses is sought. When the 
pulse train contains several PCM signals multiplexed together, there is 
also the task of multiplex framing or frame synchronization whereby 
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the individual channels must be identified. Word synchronization can 
be derived from frame synchronization if the words are always arranged 
in a definite order within a multiplex frame; otherwise, word synchroniza- 
tion is acquired independently. This article will consider only the 
problem of word synchronization, hereafter simply called ‘‘framing.’’ 

Framing is ordinarily accomplished by using supplementary framing 
pulses inserted among the information-bearing pulses at predetermined 
intervals. The receiver will then find these framing pulses by searching 
and testing for the unique pattern of these pulses. If the framing pulses 
are inserted between every word, a substantial loss of channel capacity 
will result; on the other hand, if framing pulses are inserted only oc- 
casionally, the PCM words will not be uniformly spaced, which is 
inconvenient for a sampled-data system. When the PCM signal contains 
known redundancies, it is possible to accomplish framing without the 
use of supplementary pulses. The signal is then said to be framed 
“statistically.””? The receiver now searches for the word grouping which 
will yield the expected statistics for the signal. A simple example of such 
a statistic is the intelligibility of voice. Voice transmitted by PCM is 
intelligible only when the PCM words are grouped correctly. Other 
criteria, easier to instrument than intelligibility, are available. Most 
signals have amplitude distributions other than the uniform distribution 
or have frequency spectra other than the flat spectrum. Both of these 
properties will be altered when framing is incorrect. One of the easiest 
statistics to measure is the average occurrence of 1’s and 0’s in the code 
words. Measurement of this statistic for the case of a linear coder operat- 
ing on a Gaussian signal source will be the main theme of this article. 
The next section will elaborate on the digit probabilities, followed by 
descriptions and analyses of framing circuits which acquire framing by 
comparing the probabilities of 1’s and 0’s in the second digit of the 
Gray code. 


II, PROPERTIES OF THE GRAY CODE 


If the amplitude of the signal before PCM encoding is centrally dis- 
tributed — Gaussian, for example — and the Gray code is used to convert 
this signal into PCM, then the individual digits of each code word will 
not have equal probability of being either a 1 or a 0. This fact can be 
demonstrated by observing the Gray code assignments illustrated in 
Fig. 1. Because the signal amplitudes are centrally distributed, the center 
codes will be used more frequently than the codes at the extremes; the 
second digit, being a 1 for the center codes, will thus be dominated by 
1’s. It should be noted that this redundancy is the result of a linear coder 
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Fig. 1 — Gray code digit assignments. 


operating on a Gaussian source. If a more efficient digitizer is used for 
this source, as for example (1) a nonlinear coder or (2) a linear coder 
followed by a digital processor to produce variable length codes or 
block codes, then this redundancy can be removed. The amount of 
redundancy in question is approximately one bit. Efficient coding would 
therefore exclude the use of statistical framing. 

Fig. 2 illustrates the probabilities of 1’s for all the digits; we can see 
that the probabilities of each digit being a 1 obey the following inequali- 
ties: 

P(D; = 1) < P(Ds = 1) < +--+ < P(D, = 1) < P(D2 = 1) (1) 
or, equivalently, the probabilities of each digit being a 0 conform to 

P(D; = 0) > P(Di = 0) > -:: > P(D, = 0) > P(D, = 0). (2) 


Any out-of-frame condition is represented by a cyclic permutation of 
the digits so that one of the inequality signs in (1) will be reversed and 
similarly for (2). Any circuit which examines the validity of (1) or (2) 
is therefore a framing detector. A few such circuits will be listed here. 
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Fig. 2— Gray code digit probabilities. 


(1) Racing counters. In this scheme two counters are connected as 
shown in Fig. 3. When either counter reaches the full count of N, both 
counters are reset to zero. Now if the upper counter is connected in 
such a way that its count is advanced for every 0 in digit 2 and the lower 
counter is similarly connected for digit 3, then according to (2) the lower 
counter will reach full count and reset both counters most of the time. 
However, if the signal is out-of-frame, the counters will be actually 
counting the 0’s of the digit pairs 3-4, 4-5, --- or 1-2, and according to 
(2) the upper counter will now be able to reach full count and reset both 
counters much more frequently. The reset signal from the upper counter 
can thus be used as an out-of-frame signal. The probability of a false 
out-of-frame signal can be made small by increasing N, the size of the 
counters. 

(2) Reversible counters. A single reversible counter, shown in Fig. 4, 
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Fig. 3 — Racing counters. 
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Fig. 4 — Reversible counter. 


can also be used to detect the framing status. The count is increased by 
a 0 and reduced by a 1. When digit 2 is connected to this counter, the 
preponderance of 1’s will keep the counter at or near the zero-count 
state and prevent it from reaching full count. When the receiver goes 
out of frame, this counter will be controlled by pulses of some other 
digit which, as can be seen from Fig. 2, has at least 50 per cent zeros; 
therefore full count will be reached within a finite time. Framing can be 
accomplished by searching for a word grouping such that the counter 
does not reach full count in a certain time interval. 

(8) RC circuit. If a random pulse train is connected to an RC circuit, 
shown in Fig. 5, then the presence of a pulse will charge the capacitor 
and the absence of a pulse will permit the accumulated charge on the 
capacitor to discharge somewhat. The process is similar to that of the 
reversible counter, except that the charge and discharge rate is now a 
function of the accumulated charge. A threshold circuit monitoring the 
voltage on the capacitor can be used to indicate the framing status. A 
pulse train derived from the received signal such that each pulse indi- 
cates a 0 and each space indicates a 1 in the second digit of the Gray 
code is used as an input to the RC circuit. When the receiver is in frame, 
the pulse pattern at the input to the RC circuit will be sufficiently 
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Fig. 5 — Framing with RC circuit. 


2990 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


sparse so that the accumulated charge will result in an output voltage 
that seldom builds up to the threshold. However, an out-of-frame con- 
dition will result in at least 50 per cent pulses present at the input, and 
the output of the RC circuit will reach threshold in a finite time. 


III. FRAMING CIRCUIT CHARACTERIZATION 


Two figures of merit are commonly used to characterize framing circuit 
performance, (1) misframe rate and (2) reframe time. Misframe rate is 
measured in terms of the probability that the circuit will indicate an 
out-of-frame condition when in fact the receiver is in frame. Reframe 
time is characterized by the probability distribution of the time re- 
quired for the receiver to achieve correct framing; this includes the time 
taken to detect the out-of-frame condition. In a conventional framing 
circuit, wherein a known framing pulse pattern is monitored, misframe 
rate and reframe time are sensitive only to the error rate of the trans- 
mission medium. Performance is degraded due to masking of the fram- 
ing pulses by noise. With statistical framing, performance is more de- 
pendent on signal statistics. Let the probability of a 0 in digit 2 be 0.05 
at the transmitter; with an error rate of 10 per cent, the probability of 
a 0 will increase to about 0.14, which is still different enough from 0.5 to 
keep the circuit in frame. The signal itself, of course, will hardly be 
usable at this error rate. On the other hand, a significant change in signal 
statistics at the transmitter may cause a collapse of framing. Care must 
therefore be exercised when the performance of statistical framing 
circuits is to be compared with that of conventional circuits. 

To evaluate the misframe rate and the reframe time of the statistical 
framing circuits, the response of these circuits to random inputs must 
be determined. Unfortunately, the statistical properties of the transient 
response of analog circuits such as the RC circuit excited by a random 
signal have not yet been completely solved. Therefore analytical results 
for framing schemes using only digital counters will be presented here; 
even with these circuits the results are approximate. 

An experimental approach is used to determine the performance of 
the framing scheme using RC circuits. The instrumentation proves to 
be rather simple and some results will be given. 


IV. ANALYSIS OF THE RACING COUNTERS 


To lend some physical meaning to the analytical results, the analysis 
will be accompanied by numerical results for a typical application, 
namely, transmission of a mastergroup of telephone channels by PCM. 
A mastergroup carries 600 voice-grade channels frequency-multiplexed 


DIGIT STATISTICS 2991 


together, and its amplitude distribution is very close to Gaussian if the 
signal load is predominantly message service.! With normal busy hour 
loading the rms value of the signal is approximately 14 of the system 
overload voltage. Under extreme conditions the rms may rise to 14 of 
the overload voltage. These figures will be used to calculate the per- 
formances of the framing circuits. A nominal sampling rate of 6 < 10° 
samples per second is assumed for the mastergroup. This rate will be 
used to translate misframe rate into misframe interval, the mean time 
between misframes. 

We can consider the two racing counters as a sequential machine havy- 
ing (N + 1)’ possible states. In Fig. 6 the (V + 1)” states are depicted 
in a square array A; each of its elements a;; represents a state where 
the upper counter has count 7 and the lower counter j. From a;; transi- 
tion is possible to 3 adjacent states @isi,;, Qisi,j41, OF Qi,j41 UpON re- 
ceiving as inputs 01, 00, or 10 respectively. In this notation the first 
digit represents the input to the upper counter and the second digit 
the input to the lower. Since the counters count only 0’s, an input of 11 
will not advance the counters and the state will remain at a;;. Starting 
from the initial state aoo, the problems are (a) to find the probability 
of reaching the bottom row when digits 2 and 3 are connected to the 
counters (this yields the misframe rate) and (b) to find the probability 
distribution of the time required to reach either the bottom row or the 
right-hand column when other pairs of digits are connected to the 
counters; this leads to the distribution of reframe time when the result- 
ing distributions are convolved. 

A convenient technique for finding these probabilities is to use signal 
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Fig. 6 — State diagram for racing counters. 
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flow graphs.” Using « = e° as the time delay operator, the transitions 
indicated in Fig. 6 are as follows: 


_  @P(01) 
CORES g RC) 
_ #P(00) 
diagonal g = T— PUD (3) 
. _  «P(10) 
and to the right r= 1 — «P(il) 7 


The denominator [1 — xP(11)] is due to self-loops at each state when 
neither counter advances. In principle, this flow graph can be solved for 
the transmission from the initial state to either the bottom row or the 
right-hand column as rational functions of the delay operator x. From 
these rational functions the total probability of reaching the bottom row 
can be calculated by letting « = 1, and the probability distribution of 
the waiting time can be obtained by a power series expansion of the 
rational functions. However, in a practical situation with counters 
counting up to 16, the calculations become extremely involved, and even 
with 20 decimal digits round-off errors become excessive. Approximations 
are therefore used to estimate the misframe rate and the framing time. 

To calculate the average misframe rate, the substitution x = 1 can 
be made before solving the flow graph of Fig. 6. This reduces complexity 
considerably and one can calculate the probability of reaching the bot- 
tom row before the right-hand column. Information about time delay 
is lost and must be estimated independently. 

The flow graph can be solved by observing that 


Q(4j) = dQ — 1,7) + 9Q@ — 19-1) + rQUij—1) (4) 
for 
1sisSN-1 and lsjsN-1 
where Q(7,7) is the probability that the state a,; is reached at any time 
starting from ao. The d, g, and r are now numerical quantities calcu- 
lated from (3) with 2 = 1. The above iteration formula is valid for all 


states except the border states of the array A. To complete the picture 
we have 


Q(00) = 1 (5) 
since do 1s the initial state, and going straight down 


Q%,0) =dQa—10) 157EN. (6) 
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To the right we have 
Q(0,j) = 7rQ(0j-1) 18575 N (7) 
For the bottom row we have 
Q(N,j) = dQ(N—-1j)+9QN-1j-1) LsjSN-1 (8) 


and the rightmost column 


QGN) =9Qt¢-1N-1)+rQaN-1) 1st1SN-1 (9) 
and, finally, the lower right state has probability 


since it can be reached only by way of ay_1,vy-1. The special treatment 
given the bottom row and right-hand column is necessary because they 
are the end states; from here we start anew at doo. 

The probability of reaching the bottom row is the sum 


U= 2, aN) (11) 


which is the probability of an output pulse from the upper counter be- 
fore the lower counter reaches count N. This is the probability of a false 
out-of-frame signal when digits 2 and 3 are connected to the counters. 
The recurrence formulas are valid for signals that are independent with 
respect to the past, so that d, g, and r are the same for all states. Statis- 
tical dependence of the two digit inputs is considered in their joint 
probabilities. This iterative procedure has been carried out, and some 
numerical results are presented below. 

Assuming a Gaussian distributed input signal the joint probabilities 
of digits 2 and 3 can be determined for normal loading with an rms input 
at 14 of the system overload and for extreme loading with an rms input 
at 1% of the overload. The various probabilities are shown in Table I: 


TABLE ] — PROBABILITIES OF Digits 2 AND 3 


01 00 10 11 
RMS 14 overload 0.0428 0.0026 0.6826 0.2720 
RMS 14 overload 0.1092 0.0244 0.5468 0.3196 





substituting these numbers into (3), we have for x = 1 the transition 
probabilities shown in Table IT. 
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Taste I] — TRANSITION PROBABILITIES 








Transition Dore Pune To heen 
RMS \% overload 0.0588 0.0037 0.9875 
RMS ¥% overload 0.1605 0.0359 0.8036 


The strong tendency to go to the right is quite evident here. The 
probabilities of reaching the bottom row before the right-hand column 
can be calculated from these data using the iteration formulas developed 
above. To translate these probabilities into mean time between misframes 
we proceed as follows. When the signal is in-frame, the lower counter 
almost always attains full count before the upper. For counters of size 
N, the lower counter resets both counters on the average of every N/p 
PCM words, where p is the probability of a 0 in digit 3. The mean time 
between misframes is then N/pU. The results are shown graphically in 
Fig. 7 for various counter sizes. At normal loading and N = 16, the 
mean time between misframes is 1.2 X 10” words which, at a sampling 
rate of 6 X 10° per second, amounts to 2 X 10° seconds or a little more 
than 2 days. When the rms signal is increased to 14 of overload, this 
mean time deteriorates rapidly to fractions of a second, so that the 
counter size has to be more than 32 to insure adequate reliability under 
severe overload conditions. 

To complete the picture on the racing counters, the framing time will 
be estimated. During search for the correct framing we observe that the 
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Fig. 7 — Reliability of the racing counters. 
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upper counter will be advanced, with 0’s occurring with probability at 
least 16, and that it will be able to reach full count without first being 
reset by the lower with probability at least 144. Thus with M digits in 
each PCM word and assuming the worst case of searching through all 
M — 1 positions, the counters will be reset on the average of 2(M — 1) 
times. Each reset requires on the average of 2N words to either the upper 
or lower counter. A conservative estimate of the average framing time 
for the worst case is therefore 4N(M — 1) words. 

As mentioned earlier, the exact distribution of the framing time is 
difficult to obtain; however, the variance of this distribution can be 
estimated. The framing time distribution can be considered as a com- 
pound distribution, where the number of times n either counter reaches 
full count during framing is governed by one distribution and the wait- 
ing time ¢ for each reset is governed by another distribution. It is known 
that such a distribution has mean E(n)HE(t) and Variance E'(n) Var(t) 
+ Var (n)E’(t) The distribution of the number of times either 
counter reaches full count before the upper counter reaches full count 
M — 1 times is governed by the negative binomial distribution.” With 
the upper counter having probability 14 of reaching full count, n has 
average 2(M — 1) as mentioned before and variance 2(M — 1). The 
waiting time for each reset is similarly governed by the negative bi- 
nomial distribution. With probability 14 of receiving a 0, the waiting 
time t has mean 2N and variance 2N. The variance of the framing time 
is therefore 2(1Z — 1)(2N) + 2(M — 1)(2N)’; for large N this is ap- 
proximately 8(1Z — 1)N’. 

For a 9-digit PCM system J = 9, and if we use N = 82, the average 
framing time for a sampling rate F, = 6 X 10° per second is 


8(M —-1)N 4X8 X 82 


F, = “6 xX 10° = 171 USEC 
the standard deviation is 
24 ; 
[8(7 — 1)N]}?_ (8 X 8)? X 82 =e daae 


F, 7 6 xX 108 


Since the distribution is the result of many convolutions, it can be ap- 
proximated by a normal distribution; with this assumption we can use 
three standard deviations as the confidence limit and estimate the maxi- 
mum framing time as 300 ysec. During out-of-frame conditions the upper 


* See Ref. 3, p. 253. Actually the negative binomial distribution governs the 
number of times the lower counter reaches full count. This average is M — 1; 
the total average waiting time is therefore (M — 1) + (WM — 1) = 2(M@ — 1). 
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counter actually receives 0’s with probability greater than 14, so that 
the estimates are conservative. 


Vv. ANALYSIS OF THE REVERSIBLE COUNTER 


The use of a reversible counter allows greater reliability without re- 
sorting to large-capacity counters as is necessary for the racing counters. 
The analysis is also simpler, since only one counter is involved. The flow 
graph for a reversible counter is shown in Fig. 8. The probability of a 0 
which increases the count is p, and g = 1 — 7p is the probability of a 1 
which decreases the count. The count cannot go below zero. The gain 
of the graph for any counter size N can be obtained by standard tech- 


qz 
() px. px px pz pr = px 
OY NO OY OO 
1 2 3 ae cy 
qz qz qx qn N-2 qx 


Fig. 8 — Flow graph for reversible counter. 
niques. The result can be expressed conveniently in the form of a re- 
cursion formula for the denominator polynomial 
Dy(x) = Dy-(x) — pqx Dy—2(«) 
where 
D(x) = 1 and D(x) = 1 — qe. 


The numerator is simply Ny(x) = p*x”. Some representative results 
are 


4 4 
= ra 12 
O@) = Te = Spee? + Ope + pies oe 
and 
px 
= 13 
Qs(x) 1 — qu — Tpqz? + 6pq?a? + 15p?q?x4 He) 


_— 10p*q?a5 = 10p*q?x® -+- Apeqta’ +- piqixs 
The average time between misframes can be determined from the 
above by differentiation. Thus” 


Lae = Qy’(1) (14) 
* See Ref. 4. 
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when p and q are for the second digit of the Gray code. The results for 
various counter sizes and for system overload at 4 and 3 times rms are 
shown in Fig. 9. It is seen that with a counter of size 16 and worst-case 
loading the misframe interval is still sufficiently long, 1000 hours at a 
6-me sampling rate. 

One disadvantage of using the reversible counter is the slow reframing 
process. When the receiver is out of frame the counter can be assumed 
to receive 1’s and 0’s with equal probability. Using formulas developed 
above for N = 16 but substituting 0.5 for p and q, one obtains an 
average of 272 words to reach full count. For a 9-digit PCM system 
sampled at 6 mc, this amounts to 360 usec for the average framing time. 
To shorten the framing time a dual-mode scheme applied frequently in 
conventional framing circuits can be used. The scheme is described in 
more detail below. 

The framing circuit is designed to have two modes of operation. In 
the in-frame mode, the counter size is set at 16 for maximum reliability; 
once the out-of-frame signal is received the counter size is reduced to 8 
to secure fast framing. The logic is depicted in Fig. 10. 

The flip-flop determines the mode of operation. When in frame, the 
flip-flop is reset and the counter must reach count 16 excess 0’s over 1’s 
of the second-digit Gray code. When the system goes out of frame, the 
probabilities of 1’s and 0’s are equal, and an output from the N = 16 
lead of the binary counter chain sets the flip-flop to the out-of-frame 
mode. In this mode the output from the N = 8 lead of the binary chain 
is used. At the same time a timer is turned on to reset the flip-flop after 
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Fig. 9 — Reliability of the reversible counter. 
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Fig. 10 — Dual-mode reversible counter framer. 


a certain elapsed time. This elapsed time is selected such that it is longer 
than the maximum time required to get an output from the N = 8 
lead when the system is searching but shorter than the minimum time 
required to get an output from this same stage when the correct frame 
is found. ‘“Maximum” and “minimum” are used here in a probabilistic 
sense to be defined later. Thus during recovery the timer is reset before 
it reaches the preset time, thereby preventing the flip-flop from resetting 
back to the in-frame mode. When the system cycles back into frame, 
the timer will return the system to the in-frame mode. Each time a 
signal appears at the counter output, the framing counter is inhibited 
one time slot in order to examine the next bit position; the reversible 
counter is also reset automatically to zero. With the proper preset time, 
the system is almost always prevented from cycling past the true in- 
frame position. 

To estimate the framing time for this scheme, we again use the exam- 
ple of a 9-digit PCM system sampled at 6 X 10° per second. For the 
worst case of searching through all 9 digits the average framing time 
is given by 


Tr = Qy'(1) + 7Q3'(1). (15) 


The effect of incorrect decisions by the timer which cause recycling is 
ignored here. The first term corresponds to detection and the second 
term corresponds to the search through the next 7 positions. The time 
spent in verifying that the last position is the correct one is not included, 
because the system will already be in frame. The above equation is 
evaluated for p = q¢ = 4% and yields the worst-case average framing 
time of 130 usec. This estimate is again conservative, since 0’s occur with 
probability greater than 14 in some digits of the Gray code. 
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The exact distribution of the framing time for the worst case may be 
determined by expanding Qy6(x)Qs'(x) in a power series. Again, this is 
difficult to do accurately. To get around this problem an approximation 
to the inverse transform of Qs(x) is determined by noting that the decay 
in the tail of the distribution is dependent mainly on the singularity of 
Qs(a) closest to the unit circle (1.01728 in this case). On this basis the 
inverse transform is approximately 


1.6986 X 10° 


“oinsys 40 frF 


qs (k) = 


IIV 
ore) 


and 
gs(k) = 0 05k <8 
where 1.6986 X 10°” is selected so that 


> as(k) = 1. 
k=8 
Using the result and returning to the x domain 


1.6986 X 10°72° 


(1 - igis) (16) 


We now make the further approximation of replacing Qis(2) by Qs(x) 
in the product mentioned above. We can therefore deal with the simple 
result given by (16) raised to the 8th power. On this basis a somewhat 
optimistic expression for the distribution of the framing time can be 
readily obtained: 

(1.6986 X 10°)*(n — 57)! 


_ (1.6986 _X 107) (nm — 57)! > 64 
P(r) = “To SAI oITgy COT BO (17) 


p(n) =0 OSn< 64. 


Qs(x) & 


The upper tail of p(n) is shown in Fig. 11. 

Taking the 10° point as the confidence limit and multiplying by the 
sampling period, we get 200 usec as the maximum framing time. Since 
the framing process is dominated by the Qs’(x) term, the error intro- 
duced by the substitution of Qs(a) for the Qie(a) term should not be 
significant. 

Finally, we note that an optimum time must be chosen for the timer 
in Fig. 10 to reset the flip-flop back to the in-frame mode. Selection of 
this time is based on the distributions of waiting times for an output 
from the N = 8 lead of the counter, first under the out-of-frame condi- 
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Fig. 11 — Distribution of framing time for reversible counter. 


tion and second under the in-frame condition. Summing the first two 
columns of Table I we obtain 0.13 as the probability of a 0 for the second 
digit when the rms input is at 14 of the system overload. For the other 
digits a probability of 0.5 is assumed. Expanding Qs(a) in a power series 
when p = 0.5 and when p = 0.13 yields the desired result. This is plotted 
in Fig. 12. If the time is chosen to be 560 frames, the framing detector 
will be in the wrong operating mode only 0.01 per cent of the time, 
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Fig. 12 — Selection of optimum time for the timer. 
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which means that the framer will seldom cycle past the true frame 
position during the framing process. 

We note for future reference that the distribution of the waiting time 
in rig. 12 for p = 0.5 isa straight line on semilog paper, which indicates 
that it has an exponential tail. 


VI. MEASURED FRAMING PERFORMANCE FOR THE RC cIRcUIT 


We introduce this section by defining the problem. Illustrated in Fig. 5 
is a typical input to the RC circuit, a random pulse train 


a(t) = > ang(t — nT) (18) 


where ap is a sequence of independent random variables assuming values 
1 or 0 with probabilities p and (1 — p), and g(t) is a rectangular pulse 
of height # and width w. When this pulse train is applied to the circuit 
of Fig. 5, the capacitor will charge when a pulse is present and discharge 
otherwise. The charging time constant is 


Rik, 


ee i() 19 
0" Ri + Ba om 
and the discharge time constant is 

Ta >= RC. (20) 
It is also convenient to refer to the attenuation constant 

Re Te 

K = ——-— =1-—. 21 
Ri + Re Td (21) 


We are interested in the transient response of the circuit y(t), particu- 
larly at times t = w,t = T + w,---,t = MT + w because they are 
the local maxima. We can proceed step by step: 


y(w) = aKE|1 — exp (—w/r.)| (22) 
y(T) = aoKE{L — exp (—w/r.)] exp [—(T — w)/tal3_ (23) 


at t = JT’ + w, the charge due to a, is added, the charge due to apo de- 
cays further with a time constant of either 7. or ta depending on the 
value of ay 


y(T + w) = aKE{l — exp (—w/r.)] 
+ {aK E[1 — exp (—w/r,.)] exp [—(T — w)ra]} (24) 
[a, exp (—w/r-) + (1 — a1) exp (—w/ra)]; 
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in general 
y(MT + w) = KE{1 — exp (—w/7.)] 3 On 
exp [—(M — n)(T — w)/ral (25) 


IT [am exp (—w/t-) + (1 — Gm) exp (—w/Ta)]. 


The framing performance of this circuit is related to the probability 
distribution of the first time that the output of the circuit exceeds a 
certain threshold. It is the distribution of the smallest AZ such that 


y(MT + w) > threshold. (26) 


To find the distribution analytically from (25) appears difficult. 
Some simplification can be obtained by assuming that the widths of the 
pulses are small or by assuming that the charge and discharge time 
constants are the same. Under either of these conditions the product in 
(25) disappears and the output is essentially of the form 


M 
a= ae 0< 6 <1, (27) 
n=0 


The behavior of the random variable z when M — o has received some 
attention,’ but the distribution of the first passage time of z with respect 
to some threshold is still difficult to obtain. 

Here the experimental approach is taken; the circuit used is depicted 
in Fig. 13. The input is derived from an analog-to-digital converter with 
a Gaussian signal as input. The output of this converter is in Gray code. 
By adjusting the level of the input signal and by selecting the various 
digits of the Gray code, a pulse train with any desired pulse density may 
be obtained. The digital timer measures the waiting time; it is started 
at the closing of the input switch and stopped by the threshold circuit. 
The threshold circuit also opens the input switch and signals the re- 
corder to write the timer output on tape. A delay circuit resets the digital 
timer and initiates the next cycle of measurement after the RC circuit 
has returned to the rest condition. Each timing and recording operation 
takes about one msec; about a million measurements were made and 
recorded in a matter of minutes. A simple computer program reads the 
data and compiles the cumulative distribution of these data as well as the 
mean and standard deviation. 

Some qualitative results concerning the effects of the various parame- 
ters will be given below. First, for all of the combinations of the parame- 
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Fig. 13 — Measuring distribution of first passage time. 
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ters chosen, the measured distributions tend to have an exponential tail; 
they plot as straight lines on semilog paper (see, for example, Fig. 15 
below). An intuitive argument can be given for this result. If we suppose 
that the threshold is set very low compared to the average output of the 
circuit, at voltages below this threshold the circuit acts more like an 
integrator than an RC circuit because it charges almost linearly and 
discharges very little between pulses. The distribution should therefore 
be similar to the distribution of the waiting times for the nth success in 
a sequence of Bernoulli trials, which has an exponential tail. Now we 
suppose that the threshold is set high compared to the average output 
of the circuit. Near this threshold, the circuit decays rapidly between 
pulses, so that a succession of many pulses in a row is necessary to drive 
the circuit over the threshold. The problem is now similar to the first 
occurrence of n consecutive successes In a sequence of Bernoulli trials, 
which again has an exponential tail. Finally, we can suppose that the 
threshold is set about equal to the average output of the circuit when the 
probability of a pulse at the input is 0.5. Near this threshold the decay 
due to an absence of a pulse is about equal to the charge contributed by 
a presence of an input pulse. The circuit therefore behaves much like a 
reversible counter in this region. In the previous section this has been 
shown to have an exponential tail. All of these arguments are of course 
approximate, but, lacking a complete theory, they serve to provide some 
insight. Knowledge that the distribution of the waiting time has an 
exponential tail enables us to use the techniques developed for the re- 
versible counter to estimate the framing time distribution of this circuit. 
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The second qualitative result is that the measured distributions of the 
first passage time for the various circuits are very much the same as long 
as their composite time constants and relative threshold settings are the 
same. By “composite time constant” is meant the time required for the 
output to reach (1 — e) of the maximum output when all pulses are 
present at the input. By “relative threshold” is meant the threshold as 
a fraction of the aforementioned maximum output. The different situa- 
ations are illustrated in Fig. 14. 

The composite time constant and the maximum output can ke com- 
puted from (25), setting all a,’s to 1. 


y(MT + w) 
= KE[l — exp (—w/-.)] > exp — | cw — n) « es Hy. a 





Td Te 


Letting M approach infinity we obtain the maximum output 
ie a eS Os Se ao cy 
me) 
Td Te 


The expression inside the summation in (28) can be rewritten as 


exp — | rear — n) C Sd. “)| (30) 


Td Tec 








where w’ = T'/w, the duty cycle of the pulses. From this we can see that 
the composite time constant is 


( =v wy (31) 


Td Te 





The third qualitative result is the following. For circuits and threshold 
settings such that with equal probability of pulses and spaces at the 
input the distributions of the first passage time are the same, the average 
first passage time for low probability of input pulses is longer when the 
relative threshold is higher. Relative threshold is defined as above. 
This result can be explained by using arguments similar to the first re- 
sult. At low threshold settings, the circuit acts as an accumulator so 
that the average first passage time is inversely proportional to the aver- 
age pulse density. On the other hand, for high threshold settings, the 
first passage time depends on the occurrence of many consecutive pulses; 
the probability of this occurrence decreases exponentially with the 
average pulse density. This result is directly applicable to the framing 
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Fig. 14 — Two situations depicting different parameter settings but with sub- 
stantially the same distribution of the waiting time to first passage of the thresh- 
old voltage. 


problem. For the RC circuit, the dual-mode operation controlled by a 
timer used for the reversible counter is not necessary. With appropriate 
choice of circuit parameters and threshold, one can achieve fast framing 
and low misframe rate at the same time. To what extent the threshold 
can be adjusted to improve framing performance depends on the stability 
of the circuit. When the threshold is set near the level corresponding to 
all pulses present, a small drift in any of the parameters will cause a 
large change in reliability. 

The framing performance of a typical RC circuit will be given here. 
Again we assume a 9-digit PCM system with 6-mce sampling rate. The 
parameters are as follows: 


pulse width = 50 per cent duty cycle 
charging time constant = 0.44 usec 
discharge time constant = 1.2 usec 
composite time constant = 0.64 usec. 


With the probability of a pulse set at 14, the variation of the distribu- 
tion of the waiting time with threshold setting is illustrated in Fig. 15. 
The variation of the misframe interval and average framing time with 
threshold setting is illustrated in Fig. 16. If the threshold is chosen such 
that the misframe interval is 105 seconds (about one day), the average 
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Fig. 15 — Distribution of the first passage time. 
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Fig. 16 — Performance variation of the RC framer with threshold settings. 
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first passage time is about 20 usec. This yields an average framing time 
of 160 usec if 8 positions are to be cycled through. Using the results of 
the reversible counter as a guide, the maximum framing time with 99.9 
per cent confidence is about 250 usec. 


VII. SUMMARY 


This paper has considered the possibility of framing a PCM signal 
by utilizing the statistics of the code digits. Three schemes for testing 
digit statistics have been proposed and their performances analyzed or 
measured. Statistical framing is shown to be feasible and effective 
whenever the signal statistics satisfies certain weak conditions. 

The authors wish to acknowledge the help of H. H. Henning and 
I’. P. Rusin for design and construction of equipment used and assistance 
given in obtaining the experimental results. The encouragement of M. 
R. Aaron is also appreciated. 
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Prolate Spheroidal Wave Functions, 
Fourier Analysis and Uncertainty 
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Dimensions; Generalized 
Prolate Spheroidal 


Functions 
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In two earlier papers* in this series, the extent to which a square-integra- 
ble function and its Fourier transform can be simultaneously concentrated 
in their respective domains was considered in detail. The present paper 
generalizes much of that work to functions of many variables. 

In treating the case of functions of two variables whose Fourier transforms 
vanish outside a circle in the two-dimensional frequency plane, we are led 
to consider the integral equation 


yo(a) = | Twloxy) Vexy elu)ay. () 


It is shown that the solutions are also the bounded eigenfunctions of the 
differential equation 


da’ d 1 — N’ - 
(1 = 2) SE — eB 4 (x oe +! 2 Je=0, (iz) 





a generalization of the equation for the prolate spheroidal wave functions. 
The functions ¢ (called ‘‘generalized prolate spheroidal functions’’) and the 
eigenvalues of both (2) and (it) are studied in detail here, and both analytic 
and numerical results are presented. 

Other results include a general perturbation scheme for differential equa- 
tions and the reduction to two dimensions of the case of functions of D > 2 
variables restricted in frequency to the D sphere. 


* See Refs. 1 and 2. 
3009 
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I. INTRODUCTION 


In two earlier papers’ in this series, the extent to which a square-in- 
tegrable function and its Fourier transform can be simultaneously con- 
centrated was considered in detail. In that analysis, the eigenfunctions 
and eigenvalues of the finite Fourier transform played a key role. These 
functions, defined for | « | S 1 by the integral equation 

1 


awi(x) = [ ey (y)ay, (1) 
can be continued analytically throughout the complex plane. They 
possess a number of special properties that make them most useful for 
the study of bandlimited functions. The functions are complete in the 
class of bandlimited functions; they are orthogonal in (—1,1) and also 
in (— 0,0); the y; are also the eigenfunctions of the integral equation 


_ fi sin e(x — y) 
AW(x) = [ aC eae v(y)dy o 


_ & 2 
A= 7 lel 


whose kernel is positive definite; y, , the eigenfunction of (2) belonging 
to the largest eigenvalue, is in an appropriate sense most concentrated 
among bandlimited functions of given energy. These and other proper- 
ties are discussed in detail in Refs. 1 and 2. Some familiarity with these 
papers will be assumed in the following. 

In the present paper we consider certain aspects of the generalization 
of this earlier work to functions of many variables. Many of the structural 
results of Refs. 1 and 2 (as was pointed out there) depend only on the 
fact that the operator defined by the right of (2) is completely con- 
tinuous and positive definite. The generalizations to D dimensions are 
perfectly straightforward: we comment briefly on some of them in Sec- 
tion II, but do not belabor them. Our main concern here is with details 
of the explicit solution of some of the integral equations that are multi- 
dimensional generalizations of (1). An unexpected dividend of this work 
is that one of these equations is of interest in the theory of masers. 

In Section III, we point out some general features of the integral equa- 
tions to be considered. Section IV treats the case of functions of two 
variables whose Fourier transforms vanish outside a circle in the two- 
dimensional frequency plane. The analog of (1) is shown to be the in- 
tegral equation 
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1 
yo(r) = [ Jy (err’)Verr'e(r’)dr’, O<sr 
0 


IIA 
—_ 


(3) 


This integral equation also describes the modes in a maser interferometer 
with confocal spherical mirrors of circular cross section (Ref. 3, p. 488). 
The eigenfunctions of (3) are shown to be the bounded solutions of 


de d 1 — N” 
(r= 2) $8 — ae + (x — ot 3 3 Je=o (4) 





that vanish at x = 0. 

We call the solutions of (4) generalized prolate spheroidal functions. 
Section V is devoted to their study: 5.1 treats the case of small c; 5.2 
and 5.3 treat various asymptotic cases.* 

In Section VI, the results of Section V are used to discuss the eigen- 
values of (3). Various asymptotic forms for these quantities are derived. 

Section VII treats the case of functions of D > 2 variables whose 
Fourier transforms vanish outside a sphere in the D-dimensional fre- 
quency space. It is shown that this more general problem can be reduced 
completely to the case already treated in Sections IV, V and VI. 

Finally, in Section VIII we present some numerical detail about some 
of the eigenfunctions and eigenvalues encountered. Applications of these 
results will be presented elsewhere.” ” 


HW. GENERALIZATIONS OF EARLIER WORK 


We denote points in Euclidean space of D dimensions, Ep , by vectors, 
X = (%1,%,++:,Xp). A square-integrable function of D variables, 
f(x), 1s said to be R-limited if it can be represented as a Fourier integral 


f(x) = (20)? | exp (ix-y) Py) dy (5) 


over the bounded region R. Here x-y = >. 2,y; is the usual scalar 
product and we write dy for [| dy, . If f is of total energy A, then by 
Parseval’s theorem 


A= [| s(x) Fae = (2)? [| PCy) Fay, 


whereas the energy of f in the bounded region S is 


* Some of the results of Sections IV and V have been developed independently 
by J. C. Huertley,!! who was led to consider (3) from its laser applications. 
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[\4@ Fae 
= [ axlony? | dx exp (iz-x) F(x) / dy exp (—iz-y)F(y) 
8 R R 


= (2r)~? [ ax f ay Ks(x — y)F(x)F(y) 


where 


K(x — y) = (2)? [ exp liz (x — y)]ae (6) 


and an overbar denotes complex conjugate. The largest fraction of energy 
that an R-limited function can have in the region S is therefore the 
maximum value of the fraction 


[ax fay Ks — yyrC@Py) / f 1G) Fay 


taken over all functions F square-integrable through Rk. This maximum 
is the largest eigenvalue of the integral equation 


W(x) = [ Ks(x — y)¥(y)dy, x R, (7) 


which is the analog of (2). 
The kernel (6) of (7) is positive definite, since 


[ de [ dy Ks(x — y)f(x)F(y) 


= (27)? [we [ aw exp (iz-x)f(x) >0 





whenever 


. f(x) Pde > 0. 


By well-known theorems (see Ref. 4, Chap. 6), the eigenvalues of (7) 
are real and positive and the eigenfunctions, orthogonal on R, are com- 
plete in the class of functions square-integrable in R. A complete dis- 
cussion of the simultaneous concentration of square-integrable functions 
in Hp and their Fourier transforms can be given in terms of the largest 
eigenvalue of (7) as in Ref. 2, Theorem 2. 

The right member of (7) can be used to extend the domain of defini- 
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tion of y. We define 
I 
v(x) => Ks — yvydy, x € Bo. 


Then for two different eigenfunctions of (7) 


| veda 
ee ae i: dy Wilx)J;(y) is dz Ke(z — x)Ks(z ~ y). 


To evaluate the innermost integral here, we observe from (6) that Ks is 
given as a Fourier transform, so that from Parseval’s theorem, 


[ Kg(z = x) Kg(z = y)dz 


L(y? [ au exp [—iu- (x — y)] = Ke(x — y). 
8 
One then finds 


[,, dswde = fade dela) f dy Bale — ybs(y) 


= =f aeaGNE GG). 


The orthogonality of the y; over # thus implies orthogonality over Ep 
as well. 

Other results of the one-dimensional case extend as easily to D di- 
mensions, but we do not dwell further here on this general structure. 


Ill, SYMMETRY CONSIDERATIONS 


In what follows, we shall be concerned with the explicit solution of a 
number of instances of (7). Considerable simplification occurs when the 
region # is symmetric, i.e., when x € R implies —x € R, and when Sisa 
scaled version of R. We write S = cR where x € cR if and only if x/e € R 
with c a positive constant. We restrict our attention henceforth to this 
case. 

Somewhat simpler than (7) is the integral equation 


ay (x) = [ exp (icx-y)y(y)dy, xcR (8) 
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which is a natural generalization of (1). We shall show in this section 
that solution of this equation is completely equivalent to solution of (7) 
when the symmetries just discussed maintain. We shall accordingly here- 
after take (8) as our equation of fundamental concern. 

From the symmetry of R, it readily follows that if y(x) is a solution of 
(8), so also is ¥(—x), so that both y.(x) = W(x) + wW(—x) and 
Wo(x) = ¥(x) — ¥(—x) are solutions as well. The eigenfunctions of (8) 
can be chosen to be either even or odd functions of x. 

The complex conjugate of (8) is 


a(x) = | exp (—iex-y)Py)dy, x ER. (9) 


Multiply (8) by Y(x) and integrate over R. Multiply (9) by ¥(x) and 
integrate over 2. Combining these equations, one finds on using the sym- 
metry of & that 


(a a) | yx) P (x) dw 


= fax [ dy exp (iex-y)F(x)IV(y) & v(—y)]. 


If then y is even, by choosing the negative sign in this equation, one 
obtains a — & = 0, whereas if y is odd, by choosing the plus sign, one 
finds a + & = 0. The eigenvalues of (8) associated with even eigenfunctions 
are real: the eigenvalues of (8) associated with odd eigenfunctions are pure 
imaginary. It follows then that (8) is equivalent to the pair of equations 


Ba.(x) = is cos cx-ye(y)dy (10) 


Gy = y sin ex yey) dy (11) 


in which 8, and 8, are real. These equations have real symmetric kernels 
and we can fall back on the extensive theory in the literature treating 
such equations. We observe that the eigenfunctions of (10) must be 
even and that 8. = 0 cannot be an eigenvalue of this equation, for by 
Fourier theory the only even square-integrable function in R for which 


[ cos cx:yW¥(y)dy = 0, xER 
R 


is¥(y) = 0. It follows then from the theorem on page 234 of Ref. 4 that 
the eigenfunctions of (10) are complete in the class of even functions 
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square-integrable in #. A similar argument shows that the solutions of 
(11) are complete in the class of odd functions square-integrable in R. 
The solutions of (10) can be chosen real and orthogonal in Ff, as can the 
solutions of (11). Solutions of (10) are automatically orthogonal to solu- 
tions of (11) by symmetry. 

We have now shown that the solutions of (8) are complete in the class 
of functions square-integrable in FR. The eigenfunctions can be chosen 
real, orthogonal, and either even (in which case the eigenvalue a is real) 
or odd (in which case a is pure imaginary). We henceforth assume the 
yw so chosen. 

By iterating (8), one finds that the y also satisfy 


W(x) = i: K.(x — y)b(y)dy (12) 
d= (<) | | (13) 


K.(x) = (Z) i exp (icz-x) dz = (2r)~” ie e* dz (14) 


with 


which is (7) in slightly altered notation and is the D-dimensional ana- 
log of (2). Since the solutions y of (8) are complete, it follows that they 
are also a complete set of solutions of (12). As was asserted, to solve (12), 
it suffices to solve (8). 

The eigenfunctions of (8) can be extended by demanding that equa- 
tion to hold for all x € Hy. It is then easy to show that the extended y 
are orthogonal in Hy and that they are complete in the class of c-limited 
functions. 


IV. THE CASE D = 2, R A CIRCLE 


We now treat in detail the equation 
av (as a2) = fey ye) dyn (15) 
R 


where # is the unit circle y? + Yo" < 1. Change to polar coordinates 
gives 


1 27 
av(re) = ; dr’ r’ do’ e"" °° er (r’,’) 
(16) 


1 Qr 
mem’ i dr’ r'Im(crr’) / do’ *"" W(r' 6’) 
0 0 


io) 
—e6 
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on making the usual Bessel function expansion. Here (7,0) is exhibited 
as a J*ourier series in @. A simple argument then gives for the eigenfunc- 
tions of (15) and their corresponding eigenvalues 


Wo.n(7,0) = Tea), Qon = 2mBo.n 
N@ : 
Yrn(t0) = Revnlt) or yg xn = 2mi%Bvw (17) 
NE ee BD ox T= Qi LD ha 
where 
1 
Byakva(’) = | Jy(err’)Ry.n(r' dr’ dr’, 0OsrSl, 
0 (18) 
n, N = 0,1,2,.... 


All the eigenvalues of (15), except possibly the a,,, have at least a two- 
fold degeneracy inherited from the symmetry of the circle. 
Our task now is to study the integral equation 


IIA 
— 


1 
BR(r) = I Jy(err’)R(r’)r' dr’, OSr 


It is convenient to make the substitutions 


y= VB, o(r) = VrR(r) (19) 


to obtain the symmetric equation 
1 
ye(r) = i Jy(err’) Verr’g (r’) dr’, OSrs1. — (20) 
0 


Note that ¢(0) = 0. We shall show that the eigenfunctions gy,,(r) of 
(20) can be obtained as the solution of a Sturm-Liouville differential 
equation. 


Let 


Ky(«) = Jy(x)Vx (21) 
and let the operator M be defined by 


[Myl(x) = i Ky(cxy) p(y) dy. 


Denote by L, the differential operator 


_@ ay a oN — cx') 
L, = = (1 “$4 (i cx). 
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Consider now 


[M Ly] (x) I Ky(cry) BR (1 —y’) £ 


+ (? _ — ev) | v(y)dy 
[Kw(exy) (1 — y’)W'(y) — ex(1 — y’) (22) 


‘Ky’ (cry) W(y)y-0 + [ v(y) Ex — y’) Ky’ (czy) 


: ‘) Ky(eny) | dy 


where the right member is obtained by integration by parts. Here primes 
denote differentiation of the function in question with respect to its 
argument. The integrated expression vanishes if y(0) = 0, since from 
(21), Ky(0) = 0. Also from (21) and the differential equation satisfied 
by Bessel functions, one has the acai: 








ee 
— 2cxyK’ (cay) + € > 


Ky" (ery) = “G44 aye t=) Ky (ery). (23) 


Sibstibite this expression in (22) to yield 


[MLy](«) = [ ¥(y)l ~ cay! (cay) 


+2 -—-N4+4eey — ea? — cy’) K(cxy)]dy, (24) 


¥(0) 0. 
On the other hand, by direct calculation and use of (23), one has 


[LMy] (x) 


1 
Le / Ky (cay) p(y) dy 
0 


I 


I v(y) la — x )ey’ Ky’ (cxy) — 2acyKy’ (cry) 


1—N 22 | 
+ ees) Ky(cay) | dy 





1 
= I v(y) )| - QeeyKy’ (cay) + {=a = a yey? 


—N*\,i1-N 2, | 
(1 + aap oN) + = ea Ky(cxy) | dy 
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1 
~ I ¥(y)[—2cayKw' (cay) + (2 — N? + e’x’y? 


— ex — cy’) Ky (cay) ]dy 
= [MLy](z) 
on comparison with (24). 

Let C be the class of functions square-integrable in (0,1) and twice 
differentiable there that vanish at the origin. Operating on functions in 
C, the operators M and L commute. It follows that solutions of 

Lip(x) = —xy(x) 


in C are also solutions of (20). Consequently, we next turn our attention 


to the differential equation. 
2 de 7 dp 1 _ Nn” 
(1 -— x) a3 24 oe + ( 


a2 





— ee + x) g=0. (25) 


V. GENERALIZED PROLATE SPHEROIDAL FUNCTIONS 


When N = 3 in (25), this equation reduces to the equation for pro- 
late spheroidal functions of order zero. We shall refer to bounded solu- 
tions of (25) for arbitrary values of N as generalized prolate spheroidal 
functions. These functions are similar in many respects to prolate sphe- 
roidal functions, as the development that follows shows. Bounded solu- 
tions of (25) exist only for discrete values of x, say yv,n,m = 0,1, 2,... 
which we label so that yvy,0 S xv. S xw.2 S ... . We denote the corre- 
sponding eigenfunctions by gy,n(x). 


5.1 Expansions in Powers of c 


Consider first the case when c = 0. Substitution of the series 
p= D ae? 


into (25) shows that we must havea = 3 + N.If N + 0, the negative 
sign leads to solutions having a singularity at x = 0. If N = 0, a second 
solution can be found, but it has a logarithmic singularity at x = 0. 
We must have therefore 


a=i+N. 
The coefficients are given by the recurrence 


as aid (a+ 2j)(a+ 27 +1) -— x 
eS a Og la eg a 1) pe Ne 
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For large j, a;4:/a; — 1, so unless the series terminates, this solution 


becomes unbounded as x — 1. Choosing x to terminate the series at 


2 
2°?" we have 


X = xwn(0) = (N + 2n + 3) (NW 4+ 2n + 3) (26) 


for the eigenvalues of (25) when c = 0. The series solution now becomes 
(when a, is set equal to unity) * 


g = Ty,(t) = oO Ry, (2) 
‘ (27) 
Ry,»A(t) = F(-nvsn+N+1;N4+1;2) 


where 


4 
Feben. =14 227064 DOr De 


¢ 1! c(e + 1) at 


is the usual Gaussian hypergeometric function. The polynomial Ry »(2) is 
readily expressed in terms of Jacobi polynomials P,‘*'” (x) (Ref. 5, 
Chap. IV). Adopting the notation of Szegé, we have 


Ries G ca ie P,”(1 — 22%, (28) 


From (27), (28) and the known properties of the Jacobi polynomials, 
one finds 


Tyn(l) = (—1)" & - ae (29) 


n 


Onn’ 


a Tvn(%) Twn (adr = 
° 2(2n + N + 1) ( 


n+ ¥) (30) 
nN 
Qn +N + 1)?(Qn + N)Ry ns 

= (2n+N +1) [(2n +N) (2n+N +4 2) (1 — 22’) 


+ N’|Ryn— 2n'(2n + N + 2)Rw nat 
(Qn + N)x(1 — 2°) © Rn (2) 
dx 
= n{(2n + N)(1 — 2a") — N]Ryn(x) — 2n*Ryn-a(2) 


«Ty n(x) = Yvan Tv nsi(2) + ynin Uw n(2) + Vee Ty eae) (31) 


* It has been called to our attention that our T'y,n(x) are closely related to the 
Zernike polynomials. These latter arise in the diffraction theory of aberrations.” 
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eee (n+ N+ 1)? 
ae Qn +N + 1)@n +N +2) 
N? 
veo! = 3(1+ Gee ENED) < 
=| n? 
YN,n as 


Qn FN)Qn +N +1) 

|\Tvn(t)|S1 for OSa¢S 1. 
The function Ty,.(2) has n zeros in (0,1]. We define Ty n(x) = 0 if 
n<0. . 


Returning now to (25) for arbitrary values of c, we attempt a power 
series solution in c” by writing 


IG SHS Taye 3 OW nt) (33) 
pit ae key ape Oe  eaj(Nin), (34) 


where the Q’s and a’s are independent of c. When this latter quantity is 
zero, this solution reduces to that already found. As is shown in Appendix 
A, the Q’s and a’s can be determined recursively in an elementary man- 
ner. We have 


Q;(N,n,2) 


ll 


SAM MTs tel) (35) 
with 

a;(Nn) = DAL Wine j=1,2,... (86) 
Lxw, atm(0) — xwn(0)]An?(N,n) 


ax(Nn)Am aN, n) — oo Aye az ‘(N, n)YnN, See , (37) 


I 
i 


m= —-j,-jtl,...,9; 7 =1,2,.... 


Here A,?(N,n) is defined to be zero if |k| > j, ork < —nork = 
and j ~ 0. In addition we have A»’(N,n) = 0,m #0, A.(N,n) = 
ao(N,n) = 0. For use in (36) and (87), obey s of (82) must be sae 
so that for n < 0, YW, = = Yvan =YvnHn = 0 

To terms of order c’ the eigenfunctions and eigenvalues of (25) are 
explicitly 
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I 


would = (243742) (00-407 42) 


(38) 
N 
(2n + N) (2n + N + 2) 
WT yn—1(2) 
4(2n + N)?(2n + N + 1) 


(n +N + 1)°Ty.n4i (2x) , ; 
7 eo a TH) e+ Oe). 


In view of (35), the series (33) can be formally regrouped to give 


+3 (1+ \é + Oe) 
feet. ( 
(39) 


ie] 


gun(t) = Das (ey Ts ia). (40) 


0 
Substitution in (25) yields the three-term recurrence 
yee i 
+ [evn + (25 +N +4) (Q94+N 49) — x1ai"" (41) 
+ yy dj” = 0. 


This recurrence can be used to determine the d;*"" and the eigenvalues 
in a manner quite parallel to that used in the study of prolate spheroidal 
wave functions. The method of Bouwkamp’ can be adopted and used 
advantageously for the computation of the d;”’" and the eigenvalues for 
values of c too large to permit effective use of (33) and (34). The d’s 
can, of course, be expressed in terms of the A’s of (35). One has 


ie] 


dnaj '"(c) = 2 A; (N,n)e", jg=—n,—nt+l1,.... (42) 
=i 


The series solutions (40) or (33), (35) for the generalized prolate 
spheroidal function are, of course, valid only for 0 S$ x S 1. To obtain 
a series valid for x > 1, we use (20) and the fact (established in Appendix 
B) that 


1 —1 
; In(cay) Very Trn(y)dy = i ”) Tusmilet) (43) 
0 n V/ cx 
The solution (40) then extends for all x by the series 


LoS own J ws2is (Cr) 


enn(t) = Aaa 2, dj C + i) Jes (44) 
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which is obtained by inserting (40) in the right of (20) and integrating 
term by term. 


The eigenvalue yy,» can be expressed in terms of the d;*’”. Divide 
both sides of the equation 


YwinPnin(Z) = | J lex’) —/cax’ enn (a) de’ (45) 


by «”* and take limits as 2 — 0. From (27) and (40), we see that the 
left member of (45) becomes 


YNyn Ds dj'"Ryn(O) = Yvn > dj". 
7 ? 
Since oe ~ (2/2)"+/z/T(N + 1), the right of (45) becomes 


[ (eal) ex ald! 


PUEDE 
= vt Nin ; IN+4 f : 
=F ye fe tye ae 
wh 
"FW FD Ear [ Peale!) sla! a 
ay hod mm 


~ Tw +2298 

where we have used successively (40), (27) and (30). The combined 
result is 

CN+ig Nin 


Yn = SPARC 4 2) ay (46) 
j=0 


The integral equation (45) is also useful for obtaining the asymptotic 
behavior of gyn(2) as 7 — ©. We have 


Nth 


rratrn() => fo du Fu(u) 2) (47) 


on letting crv’ = u. Now na wai)’ = (u*"Jy), so that (47) can be 
integrated by parts to yield 


YN nPNn (x) = = | ui (u ) ee PN, n(u/ex) |” 


NE 





0 


d m 
— [ du why wai () = reac aus ol) | 
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i 
= — Vcr ovn(l)Jnuilex) — R. 
Cx 


For large x, this becomes 


2 cos [ex — (N + 1)(a/2) — (w/4)] 


Cx 
+ O (5) . 
x 


This of course is consistent with (44). If now we define gy n*(x) to be a 
generalized prolate spheroidal function normalized so that for large x 


orate) ween WEN) — G/M ag 


Yun = 4/2 en0X() (50) 


a relation that will be useful to us later. 


YN,nPN n (x) = pnyn(1) 
(48) 


(48) gives us 


5.2 Asymptotics for Fixed n and Large c 


The behavior of generalized prolate spheroidal functions for large c 
can be determined by methods quite parallel to those used in Ref. 7 in 
discussing the prolate spheroidal functions. Five different asymptotic 
forms for gyn(x) are found, depending on the x range under considera- 
tion. These are properly joined to furnish a solution for all x. For most of 
these regions, we content ourselves here with writing only the leading 
term of the asymptotic development. 

In (25) we make the substitution t = z+/c. There results 





Ly — (1/c)M¢ + (x/c)e = 0 (51) 
where the operators are given by 
_d ,t+-N  , 
L= iE + Z t 
(52) 
SY eee es 
Maa lays 


Now the equation 


LU + )\U = 0 
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has solutions 
i= Un.n(t) = ea EOP) 
An = 4n + 2N + 2, | ea | Pd 


(see Ref. 5, p. 99) where L,,‘” (x) is the Laguerre polynomial of degree n 
in Szeg6’s notation. The function Uy,,(t) has n zeros in (0,0). This 
suggests attempting solution of (51) for large c by the series 


(53) 


ouat = Una ll) + 2 /e) SiN) (54) 
xwa(o)/e = dn + 2N +24 Yo (1/e')bs(WNn). (55) 


We now note that 
MU n(t) = twin Unwnge + pwn Unin + Hx Un» (56) 
where 
vn = (n+ 1) (n + 2) 
pn = —[(2n +1) (n+N 44) 4+ 4] (57) 
tvn = (n+N)(n+N-—1), 
a fact which can be readily derived from (52), (53) and the properties 
of Laguerre polynomials. The perturbation scheme of Appendix A applies 
therefore, and we find at once that 
S;(N,n,t) = 3 Be Win) Un naae(l (58) 


where the B’s and b’s are given by the recurrence 


1 
b;(N,n) = 2 Bal (Nn) bv nats ot eee 


8mB,,7(N jn) = 2d, bi. (Nn) Bm? *(N,n) 
k= 


1 
=D Bain WV in)nwmnst(m =) O°) 
Me ey eg oe Lele na 9S. gd Aye es 
with the convention B,’(N,n) = 0 if |k| > j, ork < —nork = 0 
andj ~ 0. We take B,,°(N,n) = 0, m # 0, Bo'(N,n) = 1, bo(N,n) = 0. 
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In this manner we obtain explicitly 
xwn(c) = (4n + 2N + 2)e — [(Qn+1)n+N+44) +3 


(N + 2n + 1)[2n? + 2n(N +1) +N +4 2] 1\ (60) 
i 4c ach C i 





which gives the behavior of xv, for large c. 
We write the solution just found as 


i) 1 7 ; 
exn'(#) = Una(t) + 205 2) Br'(Nin) Un msox(t), 
l= =—7 


t= ave. 


The right side of (61) is ordered in powers of c’ when expressed in terms 
of the variable ¢. However, if t = x/c is substituted, the terms are no 
longer so ordered since Uy,m42(av/c)/Un,m(a~/c) = O(c). The range 
of x values for which the first few terms of (61) furnish information 


about ¢y,, vanishes as c gets large. We shall use (61) only forO0 S x S 
1/c*. 


(61) 


To obtain an asymptotic form for gy.n(x) fore * S 


x 
it is convenient to write gy.»(z) = 2” yyn(x) and set y 
Equation (25) now becomes 


<1 - (1/e) 
a eae 


_ an dy it oy. | dy 
a=) % +[2— ew + ayy] # 


= | x —c— (w = 3)(w ae ;) “F ey | y = 0. 


Into this equation, substitute xv,.(c) as given by (60) and set 


y= a a v 
Vy + yn 


LG (2)- 0. 
dy Cc 
rs aNti (1 oat y) necy 
PN,n (x) ies VJ y(1 ae Paes (63) 


joviag ~essas.. 


c 


(62) 


One finds then for », 


Accordingly we write 
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To obtain an asymptotic form for gy,n(2) valid near x = 1, set 
y = s/cin (62) and again use (60) for xv... There results 


dy ldy = 
dy? ° s ds =e (2 ) ak 
Accordingly we write 
enn (t) ~ 2 T,(cy) 


=-Vi-#, 1-(/e)s2< 


where J,(2) is the modified Bessel function. (See Ref. 8, Vol. IJ, p. 5). 
When x > 1, we set z = ~/x? — 1, and have y = iz. The solutions 
gv and gyn’ then give rise to two more asymptotic forms. We write 


(64) 


oven (a) = 2% J,(cz), lseS1+ , (65) 


eicz (1 =— tz)” 1 
c 


5 _ N+t oe OE OR 
YN,n (x) = & Re A/a n" ig) Lape eg. (66) 


= v— i. 


We now determine the joining factors for these five solutions. In 
ovn and gyn’ we set x = u/c! and let c become large for fixed u. One 
finds 


n (2n+N-+4)/4 
(— 1) c yeti Vc/2 


Onin (u/c*) i 


n! 


¢ —(2n+N+4) /4 - 
(2 = u/e) ~ 2 ttn tty ele 
Nn = N+2n+1 


where we have used the fact that 
La"(utse) ~ (=1)"ue" nt 
When y = v/+/c, one finds for fixed v and large c 
eun'ly = 0/Ve) ~ de'V//o 


v/V/e) ~ evo 





I 


oun (Y 





where to obtain this last expression we have used the known asymptotic 
formula I(x) ~ e*/~/2rx (see Ref. 8, Vol. II, p. 86). Finally, when 
z = v/~/c we find 
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v/+/c) ~ 4/2 eevee al) 


enn (2 = v0/vV/e) ~ ao ee a 


where we have made use of the formula (see Ref. 8, Vol. II, p. 85) 
Jz) ~ (2/2) cos (2 — 1/4). 
All these results can be summarized in the following statement: 


enn (2 


ef ety @® (t’), 0 = 7m < a 
gNtiery (1 Zs y)* He e 
kay Nt c'sa<1l-e 
Vy(1 + yy 
Ovin(x) ~ 4 ksx™ "TI, (cy), Poets ea 
kaa 47, (cz), l<¢e¢s1+ec" 


er(l 42)” 


N+} _ & Ot Te) —1 
ksa “* Re Jia + iz) Lee = 


lA 
8 
om 
=P) 
=] 
Ww 


where 


t=aV/e, y= VIl-® 2=V2—1 


N+2n+1 N/24+} — 
(—1)"2 reer en Prt, € 


i= 
n! 
N+2n+3/2 N/24+3 — 
a (—1)"4/72 APO ale EON a ra ae 
3 4 n! 
N+2n+2 N/2+} — 
ke (—1)"2 renee fey Neha ee 
SS 


n! 
is the asymptotic form for large c of a bounded continuous solution of 


(25) belonging to the eigenvalue (60). 
We next calculate the normalization constant 


1 oe 2 
waa ~ |, Graal ae, 


For the contribution due to gy, we find 





i , 

c 1 1 c 
d Bile ahs ec P 2 [ dt —H 2N+I j oe p 2 
[ aele (P= fete.) 


_ i ps du eu [Ln (uy) 
2V/c /0 


= DGt-e Nap) N+2n - Ve 
eG (Ls Ole er) | 


3028 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


where we have used the fact that 


’ —ria a 2 N(n “a a =F 1) 
oe be ae) eS 
i exh" Ce)" dx Tin + 1) 
(see Ref. 5, p. 99). It is not hard to show that the contribution to 1/N yn." 
from integration over the region ¢ * S x < 1 is O(c%e-V“) for some p > 
0. We have then 


Nw 2 20 (n + 1) 


~Tin+N+1)° (68) 


5.3 Asymptotics for n and c Both Large 


The techniques employed here again follow very closely those used 
in Ref. 7. We accordingly give a minimum of detail. 
We assume that when n and c are both large x can be written 


Xvn ~e + 26¢ + bo thife+---. (69) 


The ranges of n and c for which this is valid will appear in the analysis 
to follow. 

In (25) make the substitution « = t/c and replace x by (69). One 
finds 


dy 


= 2 
ee (+is )e+0(2)=0 


and hence for large c, g(t) ~ VWtJ y(t). We write 





ova(t) = VxIn(cx), Oz (70) 


fd 


Returning to (25) with x replaced by (69), we B Observe that the sub- 


stitution 
ex E cx S56 12) 
_ P 2 6 Lae ‘ 


eg = 
V1 — x 


yields 5 — ~ + O (2 1) = 0, so that for large c, v becomes constant. After 





saunas this solution by a complex constant, we take its real part 
for the next section of y. Explicitly we define 


i ee 
5 tee z cos | ex - 5 log a — (N + 3) i (71) 


me V1l-—-2 
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Note that when x = u/+/c and ¢ is large (71) becomes 


Ovin () ~ 2 eos | uve — (N + 3) ale 


The asymptotic formula for Jy (see Ref. 8, Vol. II, p. 85) shows that 
ovina (a = u/c) ~ V2 cos | we — (N + 3) =| 


also, so that gyn’ and gy,’ agree for large c in the neighborhood of 
z= 1/vVe. 

To find an appropriate asymptotic form for ¢ valid near x = 1, 
substitute g = 2*"e"°y into (25) with x given by (69). Now make 
the substitution 2 = 1 — 7€/2c. There results 


du Ce 6 re 1 
tae T GQ= pe G-is)uso() = 0. 
Accordingly, we are led to define 


evn (2) = gtehO—g E — i3s 1; —2ie(1 — 2) | (72) 


where 


oe ae av , aa+1)2 
P(a,b;z) = 1 PR be aD 7) mT 


is the confluent hypergeometric function in the notation of Ref. 8, Vol. 
I, Chap. 6. 

The solution (72) is real. Its asymptotic form for large c when x = 1 + 
v/+/c can be found from the knownf asymptotics for the @ function. 


One finds 
£6 (9/4) 
Ope ee Pe ie 
Onn (x v/Ve) WET TS as 


cos [ove + > log (2vx/c) + 0(6) — “| 
where the real functions R(6) and 6(6) are defined by 


r (; ns if) = R(a)e, (74) 


This latter definition is made precise by requiring 6(6) to be continuous 
with 0(0) = 0. 
Now when x = 1 — v/+/c, (71) shows that 


+ See Ref. 8, Vol. I, p. 278, Eq. (2). 
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1 
y = STDs 
gvin (x = v/V/C) aoe 
(75) 
cos| ove + Sloe 5 7 + (Ni 4s 2) 2 e|). 
2° Q/e ae” 
Comparison of this expression with (73) shows that gy.,’ and (— Lt 
R(d)e" “on n'/2+/m are asymptotically the same for x = 1 — v//c 
provided 
c+ 6 log(2V/c) — 0(8) = (N +.1)(a/2) + mq (76) 
with q an integer to be determined shortly. 
Quite analogous to (71) is - solution for x > 1, 


77> (Ww +08- 
VJ xe Vx — 1 


When « = 1 + v/v/c and ¢ is large, this solution becomes 
e5(m/2) 


onan (x =1+0/Ve) ~ a Pe ieee 





gtr!) cos Ene — = 5 log = 





onin (2) = 


coslo+ ove — 2 5 oe 5 - w+1 3-7], 


Comparison with (73) shows that this is the same as (—1)"(R(5)e"""/2- 
V/ ren n(a = 1+ v/+/c) when account is taken of (76). 
Our results thus far can be summarized as follows: 


(~/xJ v (cx), O<sxsce° 
oa a : ae 
z eos| ex 5 log ea — (N + 4) 5 
7C V1 — 2 


(- 1) "R (8) eur ght ic(1—2) 


Pere . ave (78) 


1 6 ; 
| eer ee : 
® E ts, 1; —2ic(1 2) | : 


1 


lx -—-1|S¢%* 


gbir/2) COS E — 5 log 2 ae ; — (N + 1) 5 _ =| 


/ xe V/x2 —1 





ez1itc? 
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is the asymptotic form for large n and c of a continuous solution of 
(25) provided 6 and q are chosen to satisfy (76) and the requirement 
that ¢ as given by (78) has n zeros in the open z-interval (0,1). The 
corresponding eigenvalue is given by xv ~ ¢ + 26¢ + O(1). Higher- 
order terms can be found by methods analogous to those presented in 
Ref. 7. 

When c becomes large and 6 remains fixed, ie., 6 = O(1), the number 
of zeros of gyn(x) inO < x S 1 can be estimated roughly from (78). 
Using the asymptotic expansion for Jy , we find that yy,,.°(x) contributes 


% = (We/r) + O(1) 


zeros as x ranges from zero to 1/+/c. From gyn (x) we find 
2 = (1/r)[e — 2Vc + (8/2) log We] + O(1) 

zeros for 1/-\/c S$ x S 1 — 1/W. Finally, by using the asymptotic 
form (73) for gyn, the number of zeros of ¢ for 1 — 1/1/e S$ x < 1 
is estimated as 

2 = (1/r)[Ve + (8/2) log Vc] + O(1). 
Since we must have n = 2, + 2 + 2, the last three equations show 
that 

nr = c+ 8 log 2v/ce + O(1). 

Combined with (76) this implies that asc > «, 

0(8) + (N + 1)(a/2) + 2q — nw = O(1). (79) 


The equation just established can be used to obtain a limiting result. 
Let N be fixed and suppose that n grows with c according to 


n = (1/x)[e + b log (2Vc)] (80) 


where b is a fixed number (independent of c). Multiply this equation 
by z, add to (76) and rearrange to obtain 


(6 — b) log (2v/e) = (6) + (N +1)(a/2) +g — ne = O(1) (81) 


where the last equality comes from (79). Divide (81) by log(2~/c). We 
then obtain the limit result: zf n grows with c according to (80), then 


lim 6 = b. (82) 


Crew 
VI. ASYMPTOTICS OF Yy n AND An,n 


6.1 Fixed N and n. Large c 


The asymptotic solution $y .(z) given in (67) has the values 
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én .n(1) => ks 
cos [ex — (N + 1) (r/2) — (4/4) . 


évn(u > 0) ~ (—1)"ks5 - 


On recalling the definition given in (49), we see that for large c 
onn* (2) ~ (-1)"Gy,n(x)/cks , 
so that for fixed n and N as c becomes large 
(=1)"ks fz 
* RIF ———_————_— = —_ — 
ext) wt = (- 4/Z. (83) 


Equation (50) then gives 





(=1)" | 


Se (84) 


YNin ~~ 


We now proceed to use (84) and the useful formula (to be established ) 








OYN,n _ YN wn 2 = 
Len — Te bye (1) — 1] (85) 
where 
1 
[evn (a) de = 1 (86) 


to get a much stronger statement regarding the asymptotic behavior 
of yy. First we establish (85)-(86). 
For simplicity of notation let us write (45) as 


Ynen(a) = if K (cax’)en(x') dx’ (87) 


where we have suppressed dependences on N. Differentiating, we find 


d¢,(x) 
0c 





OYn 
5 Pie) + Yn 
; ; dyn (2") 
= | va! K' (exx')on(x') dx’ + { K (cxz’) aE dx’. 
0 0 


Differentiating (87) with respect to x gives 


1 
Yen (ae) = 2 [ 22x’K' (cox!) o,(a!) de’, 
“ 0 


so that (88) becomes 
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0 D7 nm / n 
2 Ca) + 4p MH) = Eapst(a) +f K(oxa!) So g 


Multiply this equation by ¢,(.) and integrate. One finds 


rs On 
aa Pn *(a)de tm f Gn(2) —— a 
12d 


= "(a) dx + i: Gy) Bena") ay! 
Yn 0 2e dz Pn Yn ; Pn ae ) 


where the last term has been obtained by interchange of orders of inte- 
gration and use of (87). Equation (85) then follows by integrating the 
first term on the right by parts and by using (86). 

Touse effectively (85)—(86) itis convenient to introduce xy,n = (—1)": 
/cyv.n. We then have 


1 OKy.n = 


Ee 
end ac ¢ PNin (1) ; (89) 
= 1 








(90) 


C>0O 


from (85) and (84) respectively. From (67) and (68) we see that 
Oe tte A rante 

Pat bDTra+N+Fi)" 

Using this expression in (89) and integrating, we obtain 


gopranss e) 7 
N-+2 —2t 
| pena ae. 


Tin+1)lrm+N+1) J 
Integrating by parts and using (90), we finally find 


7 (—1)" _ (= 1)"92 eNeranns gh tae ye | () 


In terms of d of (13), we find from (17) and (19) 


ov,n (1) om ks'Nwyin 


log kw,n |e = 


Ann = CYN sn (92) 
so that 
go Ter tl ee | (2) 


6.2 Fixed N and n. Small c. 


We use (46) to obtain an expression for yw,» for small c. From (42), 
it follows that 2 4 
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do'"(c) = ps A_,(Nyn)e" = A_,"(Nn)e"[1 + O(e)] 


(94) 
BT (In + N+ 1)TQn + N +2) 
where we have used (124), (26) and (82). From (42) one has 
>, aj" = di” + O(c) = 1+ O(c’). (95) 
7=0 
Equations (94), (95) and (46) now give 
Yvon (-—1)"T(n + 1)T(n + N + 1) 2ntN+h (96) 





~ RNAP (n - N+ 1)FQn +N + 2) 


Higher-order terms could be obtained in a similar manner. An alterna- 
tive route, however, is to use (85) and (86). From (39) and (30), one 


sees that [22n + N + Ll} ¢ = .) (—1)"[1 + O(c‘)] is the normaliza- 
tion factor for (89). Using (39) one then finds for a normalized solution 


ewn(l) = (-1)"V2Gn +N +1) 
N’e ; 
I 7 Te EN EST ae 
Inserting this expression in (85) and integrating, we find 
— (-1)'Ta@+ Dim + N+ 1ertxt 
— 22tNHIT (In + N + 1)T(Qn + N + 2) 


N°’? 4 
[ + ian + NV} Gn FN apt OY )|. 





YNin 
(97) 





6.3 Asymptotics for n and c Both Large 
To obtain an expression for yy,n valid for n and ¢ both large, we use 
(77) and (49-50). For the asymptotic solution (77) we have 
R(5)e5/)(—1)4 


and for very large x 


er!) cos [ex — (N + 1)(r/2) — (r/4)] 


~v.n(t 0 ) om V/ xe . 
WTC v 





Comparison with (49) shows that gyn” = Wxr/ce enn, and (98) 
and (50) now give 
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—1)9R(6)e—3(4/4) 
Ynin™ ( a . (99) 


Now (91) and (97) show that for large and small c the sign of yv,n 
is the same as the sign of (—1)”. As c varies, yv,n cannot change sign, 
for by (92) if yv,n were to vanish for some value of c ¥ 0, so would 
Aw.n. Since, as we have noted in Sections II and III, the kernel K, of 
(12) is positive definite, this is impossible. We can therefore replace 
q by n in (99) and we have 


q = n(mod 2). (100) 
From the definition (74) of R(é6), one has 


[R(s)?) = 7 G + is) r (G _ i’) = Ge (101) 
Here we have used the functional relation [Ref. 8, Vol. I, p. 3, Eq. (7)] 
for the gamma function 
(4 + 2)P(4 — 2) = wsee zz. 
Equations (99), (100), (101) and (92) combined are 
ee a 
Finally from (80), (82) and (102) we have the limiting result: if 
n = [(1/r)(c + b log 2V/c)] 


where the brackets denote “largest integer in” and 6 is a fixed number, 
then 


, 1 
lim Ay,n food i+e 





(103) 


VII. THE CASE D > 2, R THE UNIT SPHERE 


In the previous sections, we have treated the important special case 
D = 2, FR the unit circle in considerable detail. Most of the analysis there 
was concerned with solving the integral equation (20). Fortunately, as 
we shall now see, the solution of that equation also affords a complete 
solution of the case R the unit sphere centered at the origin in Ep, 
D = 3,4, ... . In treating this general case, we shall draw freely on the 
theory of D-dimensional spherical harmonics as given, for example, in 
Ref. 8, Vol. II, Chap. XI. We follow the notation of this work and set 


D=p+2, p=$1,2,.... (104) 
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Let x = rand y = r’'n where — and n are unit vectors in F142 . Equa- 
‘tion (8) now becomes 


av(r,é) = [ dr’? if exp (terr’E-n)W(7’,n) dQ(n) (105) 


where © is the surface of the unit sphere in L4. . 
Now let 


MW) = (N + p) NERD! 


and let Sy'(z), 1 = 1,2, --- , h(N,p), be a complete set of orthonormal 
surface harmonics of degree N. The Funk-Hecke theorem (Ref. 8, Vol. 
II, pp. 247-248) asserts that 


N =0,1,2,..., (106) 


[ exp (ierr'£-n)Sy'(n) dQ(n) = Hy(err’) Sw! () (107) 
where 


Hy(err’) 


( ee 1 
Se a OO) 
r (?#: ) (N+ p—1)!7 


is independent of | and Cy’(uw) isa Gegenbauer polynomial (Ref. 8, Vol. 
IT, p. 235). By expanding y-in. surface harmonics, 


o h(N,p) 


W(r,E) = 2) Dd) Rwalr)Sw'(), 
N=0 Il=0 
we find from (105) and (107) 
1 
avaRvar) = [der Hy (err Reale’) (109) 
0 
from which it is seen that Ry,1(r) and ay,; are independent of J. We have 


the expected degeneracy of eigenvalues due to spherical symety 
Now [Ref. 8, Vol. IT, p. 236, Eq. (25)| 


Cy? (u) ok (- 1 (p) (1 _ ee a” ~(1— i ee i a 


2v (p +1 | 
(25 ), 


where (a)y = a(a + 1)--- (a +N — 1), 80 that from (108) 





? 


’ Py alata ty Oe Wd iz ierr’ uy d™ 2\ N+(p-1) /2 
ee aay aoa alee aoe | due aan wu) . 
rp (22 \or (PTs 
2 2 Jw 
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Integration by parts gives for the integral here 


(=4)" [ du(1 $s yeyrtonnie a glenn 
= du 


1 
= (—ierr’)” | du gory =) ig is i 
Sy 


(—1) "Var (w + po ‘) NPC!) PT wapplerr’) 
where we have used the Poisson formula 
1 
P+ DIAz) = e/2)" fe =e) du 
= 


[Ref. 8, Vol. II, p. 81, Eq. (7)]. We have then, finally 
Hy(orr’) = i% (29)? Iva pelorr’)/(err’)?”. 


We see now from (109) that the eigenfunctions and eigenvalues of 
(105) are 


Wy ,tn(7,€) = Ry alr) 8w'(¥), l= 1, 2, tee ,h(N,p) 
cnn = 1 (20) By .n (110) 
Nw = 0; 1,:2;..4. 
where 
a ie (err) 1 
Bultnat Sf ee a ) ar (111) 


a: (err) 2 
These equations are the analogues of (16), (17) and (18). Set 
y= Bere g = pPtDep (112) 


Equation (111) becomes 


1 
0) = I Tuspnlert’)/erre(r’) ar’. (113) 


This, however, is (20) with N replaced by N + p/2. The formulae of 
Section IV for the solutions of (20) can be taken over exactly replacing 
N by N + p/2 throughout. (Expressions involving factorials must be 
replaced by the appropriate ones in terms of I’ functions when p is an 
odd integer.) Together with (110), (111) and (112), they provide solu- 
tion of (105) for all D = 2. 

It is interesting to note that the one-dimensional case treated in Refs. 
1 and 2 can be obtained as a special case of the present theory by ap- 
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propriate interpretation. The parameter N of this section is the degree 
of the homogeneous polynomial solution to Laplace’s equation in D 
dimension afforded by the spherical harmonic Sy’ when expressed in 
rectangular coordinates. When D = 1, Laplace’s equation d'y/dx” = 0 
has only two homogeneous solutions, Y = k and y = 2, respectively of 
degrees zero and one. For D = 1, ie.,p = —1 from (104), we have 
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Fig. 1 — Curves of xw,n of (25) vs c. 
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only two allowed values, VN = 0 and N = 1. The quantity N + p/2 
occurring in (113) then has values —% and 4. The kernel becomes 
/2/x cos crr’ and +/2/x sin err’ respectively in these two cases, and we 
retrieve the integral equations for the even and odd prolate spheroidal 
functions of zero order. Note that when N = -43, (25) reduces to the 


prolate spheroidal equation. 


VIII. NUMERICAL RESULTS 


A program for the IBM 7090 has been written to compute generalized 
prolate spheroidal functions using formulae (40) and (44). Trial values 
for the xv,. were obtained from (34) and (55) and the recurrences 
(36)-(37) and (59). The method of Bouwkamp* was then used to cor- 
rect: these estimates and obtain the d;*’”. Values of yy, were obtained 
from (46) and these were converted to values of } by Ayn = CYn.n’- 

Fig. 1 shows plots of xv. versus c. Fig. 2 gives the behavior of the 
first few Awy,, . By definition of the labels, xv.n41 = xv.» for Njn = 0, 1, 

. and if c > 0 the inequality is strict. From Sturmian theory, it follows 
that xv41n. > xw,.- For the \’s, one can show correspondingly that 
Anjnti < Ann and Aw4inn < Ann for Njn = 0,1, .... The problem of 
ordering the \’s and x’s for all N and n appears to be a difficult one. Some 
values are listed on Table I. 

Figs. 3 and 4 show plots of gy,n(x) versus x for N = 0,2,n = 0,1,2,3 
and ¢ = 2,10. Values of the gy,» for a larger set of parameter values are 
given in Table II. Normalization is as in (86). 
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Fig. 2— Curves of \w,n of (13) and (15) vs c. 
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Fig. 3 — Some generalized prolate spheroidal functions, gy,n(x). 
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Fig. 4 — More generalized prolate spheroidal functions, enn (2). 
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TasBLE I — NuMeERICAL VALUES OF xywy.n AND Ayn 


x 





x 


N=0 n=0 
7.5499895 — 1 2.4968775 
8.7434899 — 1 6.0585348 
1.2395933 + 0 2.2111487 
1.8225178 + 0 4.2951906 
2.5857968 + 0 6.2963045 
4.4622709 + 0 8.8705036 
6.5208586 + 0 9.7495117 
8.5869176 + 0 9.9534230 
1.8690110 + 1 9.9999957 

N=0 n=1 
9.2562398 + 0 1.0829815 
1.0847476 + 1 6.7214485 
1.38698728 + 1 6.6745424 
1.7898720 + 1 2.6742780 
2.3241561 + 1 5.7877057 
2.9277622 + 1 8.3060712 
3.5550580 + 1 9.4973850 
4.1805821 + 1 9.8782700 
4.7985976 + 1 9.9738554 
5.4108072 + 1 9.9947801 

N=0 n=2 
2.5751488 + 1 1.8834675 
2.6773866 + 1 1.9235204 
3.8241737 + 1 1.6017987 
4.4846367 + 1 8.1254764 
5§.3021146 + 1 2.5847455 
6.2527715 + 1 5.3544699 
7.2854528 + 1 7.8635574 
8.3461406 + 1 9.2600949 
9.4019226 + 1 9.7915064 
1.0443896 + 2 9.9484586 
1.147438138 + 2 9.9882732 
1.2496987 + 2 9.9974793 
1.3514611 + 2 9.9994820 
1.4528810 + 2 9.9998984 

N=0 n= 
4.9250694 + 1 6.0066949 
5.0761114 + 1 9.8333952 
6.1688709 + 1 3.5422330 
7.4995083 + 1 3.5278392 
8.3823340 + 1 2.0130790 
9.4336396 + 1 8.2918248 
1.0659367 + 2 2.4212641 
1.2034708 + 2 4.9658387 
1.3504432 + 2 7.4660703 
1.5007176 + 2 9.0244395 
1.65024389 + 2 9.6944048 
1.7977291 + 2 9.9165294 
1.94383894 + 2 9.9791376 
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7. 
8. 
9. 
-0867089 + 
.2145589 + 
.3409696 + 
-4657506 + 
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ot bak et et bt et et TOD OD 
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9163765 
4119661 
3394615 
3427678 
3086855 
7170130 + 
1310500 +- 
7555900 + 


— 


+ 
+ 
+ 
+ 
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N=1 
6255011 + 


7912353 + 


4832293 + 
0401459 + 
7326440 + 
5219234 + 
3565692 + 
1976089 + 
0297509 + 


N=1 


6265101 + 
7820310 +- 
9160037 + 
5464880 ++ 
3286568 + 
2759605 + 
3789365 + 
5955815 + 


N=1 


.4258409 ++ 
.5789319 + 
.6749767 + 
-0834214 + 
.2001776 + 
.3350648 +- 
-4872078 + 
.6522672 + 
.8237982 + 
-9961900 -++ 
2. 
2. 


1666412 +- 
3347382 + 


a pe pe 


NNNNR RR eRe Ree 8 


n 
1 
1 
1 
2 
2 
2 
2 
2 
2 
2 
2 
2 


9 .4982658 
1.3986168 
1.6123183 
4 .8326866 
7.8473505 
9.3671678 
9 .8534266 
9 .9998314 


=1 


7.4672551 
1.8549511 
3.8313651 
1.6818804 
4 .2912557 
7.1473948 
8 .9618892 
9.7041388 
9 .9279210 


= 2 


4 .6976877 
1.9082396 
1.0305314 
8.3652966 
44641026 
1.6080875 
3.9082845 
6.6399691 
8.6120123 
9.5495434 
9 .8759352 
9.9692509 


= 3 


7 .6540787 
4.9988893 
1.1190662 
1.0298512 
4.5573797 
1.5017502 
3.5789707 
6 . 1969392 
8 .2818634 
9 .3866345 
9.8150785 
9 .9500835 
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N=2 n=0 N=2 n=2 

1 9.4976317 + 0 3.9517707 — 4 1 4.9292042 + 1 1.0614368 — 14 
2 1.1710916 + 1 1.9088335 — 2 2 5.0922802 + 1 1.7074109 — 10 
3 1.5291960 + 1 1.3627864 — 1 5 6.2566028 + 1 5.5396960 — 5 
4 2.0048498 + 1 4.0298411 — 1 7 7.6509011 + 1 5.0165514 — 3 
5 2.5667098 + 1 7.0317221 — 1 8 8.5676381 + 1 2.7077307 — 2 
6 3.1747966 + 1 8.9417071 — 1 9 9.6519700 + 1 1.0455606 — 1 
7 3.7952889 + 1 9.7092917 — 1 10 1.0904728 + 2 2.8443961 — 1 
8 4.4125829 + 1 9.9324606 — 1 11 1.2295597 + 2 5.4626700 — 1 
10 5.6324064 + 1 9.9972026 — 1 12 1.3768257 + 2 7.8216583 — 1 

13 1.5265485 + 2 9.1919613 — 1 

14 1.6752489 + 2 9.7528766 — 1 

15 1.8220066 + 2 9.9334500 — 1 

N=2 n=1 N=2 n=838 

1 2.5333581 + 1 4.1659113 — 9 1 8.1275289 + 1 9.2115325 — 21 
2 2.7088321 + 1 4.0530517 — 6 2 8.2854667 + 1 2.3940665 — 15 
5 3.9788041 + 1 2.5787154 — 2 5 9.4065006 + 1 3.2594658 — 8 
6 4.6842565 + 1 1.1592915 — 1 10 1.3679420 + 2 5.8064778 — 3 
7 §5.5371030 + 1 3.2573165 — 1 11: 1.4973956 + 2 2.6682390 — 2 
8 6.5067655 + 1 6.0754452 — 1 12 1.6450899 + 2 9.5057491 — 2 
9 7.5425367 + 1 8.3161633 — 1 13 -1.8113152 + 2 2.5380401 — 1 
10 8.5969115 + 1 9.4455397 — 1 14 1.9931616 + 2 4.9785798 — 1 
11 = 9.6442775 + 1 9.8484078 — 1 15 2.1846670 + 2 7.3845601 — 1 

16 2.8793070 + 2 8.9389256 — 1 

17 2.5727411 + 2 9.6462917 — 1 

18 2.7635393 + 2 9.8968676 — 1 

TaBLE II — VALUES OF ¢y,n(z) 
N=0 n=0 
x c=1 c=2 c=5 c = 10 

0.1 4.74638 — 1 5.55421 — 1 9.15662 — 1 1.31455 + 0 
0.2 6.68776 — 1 7.74706 — 1 1.22032 + 0 1.62247 + 0 
0.3 8.14070 — 1 9.27095 — 1 1.35165 + 0 1.57689 + 0 
0.4 9.31948 — 1 1.03607 + 0 1.35103 + 0 1.30428 + 0 
0.5 1.03044 + 0 1.11011 + 0 1.24626 + 0 9.31637 — 1 
0.6 1.11351 + 0 1.15353 + 0 1.06660 + 0 5.703825 -— 1 
0.7 1.18341 + 0 1.16921 + 0 8.43474 — 1 2.91331 — 1 
0.8 1.24157 + 0 1.15957 + 0 6.07845 — 1 1.17077 — 1 
0.9 1.28896 + 0 1.12695 + 0 3.86969 — 1 3.19741 — 2 
1.0 1.32627 + 0 1.07383 +0 * 2.01532 — 1 3.00159 — 3 
1.1 1.35405 + 0 1.00285 + 0 6.38588 — 2  —1.09501 — 3 
1.2 1.37278 + 0 9.16840 — 1 —2.24980 — 2 4.23236 — 4 
1.3 1.38285 + 0 8.18791 — 1  —6.18395 — 2 6.58696 — 4 
1.4 1.38464 + 0 7.11797 — 1  —6.43066 —2 —2.21883 — 4 
1.5 1.37853 + 0 5.98995 — 1 4.32142 —2 —5.95391 — 4 
1.6 1.36489 + 0 4.83499 —1 —1.22744 —2 —1.21194 — 4 
1.7 1.34410 + 0 3.68328 — 1 1.68342 — 2 4.20886 — 4 
1.8 1.31655 + 0 2.56332 — 1 3.61314 — 2 3.79898 — 4 
1.9 1.28264 + 0 1.50130 — 1 4.20554 — 2 —8.00341 — 5 
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N=0 n=0 
x c=1 c=2 c= 5 c= 10 
2.0 1.24281 +0 " 5.20429 — 2 3.52268 — 2 —3.67577 — 4 
2.1 1.19748 + 0 —3.59472 — 2 1.94309 — 2 —2.42458 — 4 
2.2 1.14713 + 0 —1.12245 -— 1 1,30129 — 4 1.44876 — 4 
2.3 1.09221 + 0 —1.75666 — 1 —1:71274 — 2 3.08117 — 4 
2.4 1.03322 +- 0 —2.25459 — 1 —2;:80628 — 2 1.76068 — 4 
2.5 9.70655 — 1 —2.61300 — 1 —3.05323 — 2 —1.11428 — 4 
2.6 9.05010 — 1 —2.83289 — 1 —2.47697 — 2 —3.05239 — 4 
2.7 8.36800 — 1 —2.91927 — 1 —1.30522 — 2 —6.14275 — 5 
2.8 7.66537 — 1 —2.88082 — 1 1.07933 — 3 1.53741 — 4 
2.9 6.94735 — 1 272956 — 1 1.38275 — 2 6.59394 — 5 
3.0 6.21906 — 1 —2.48005 — 1 2.20781 — 2 1.15331 — 4 

N=0 n=1 
x c=1 i c=2 c=5 c= 10 
0.1 7.57682 — 1 7.49125 — 1 7.51850 — 1 1.11517 + 0 
0.2 1.00189 + 0 9.78062 — 1 8.84348 — 1 9.64135 — 1 
0.3 1.08562 + 0 1.03485 + 0 7.51864 — 1 2.39045 — 1 
0.4 1.02660 + 0 9.38690 — 1 4.08540 — 1 —6.68891 — 1 
0.5 8.24694 — 1 6.94825 — 1 —7.50154 — 2 —1.35733 + 0 
0.6 4.76162 — 1 3.08610 — 1 —6.10130 — 1 —1.58691 + 0 
0.7 —2.29984 — 2 —2.11235 -— 1 —1.10302 + 0 —1.36404 + 0 
0.8 ' —6.75867 — 1 —8.51754 — 1 —1.47075 + 0 —8.87374 — 1 
0.9 —1.48414 + 0 —1.59571 + 0 —1.65550 + 0 —4.07419 — 1 
1.0 —2.44790 + 0 _—2.41297 + 0 —1.63388 + 0 —9.25172 — 2 
1.1 —3.56556 + 0 —3.30607 + 0 —1.41956 + 0 2.63841 — 2 
1.2 —4.83382 + 0 —4,.22107 + 0 —1.05901 + 0 2.11679 — 2 
1.3 —6.24769 + 0 —5.13836 + 0 —6.20954 — 1 —1.29202 — 2 
1.4 —7.80051 + 0 —6.02868 + 0 —1.82563 — 1 —2.10428 — 2 
1.5 —9.48401 + 0 —6.86308 + 0 1.85537 — 1 —2.91179 — 3 
1.6 —1.12884 + 1 —7.61390 + 0 4.31871 — 1 1.47146 — 2 
1.7 —1.32026 + 1 —8.25574 + 0 5.32289 — 1 1.36618 — 2 
1.8 —1.52139 + 1 —8.76628 + 0 4.92313 — 1 —1.338175 — 3 
1.9 —1.73086 + 1 —9.12711 + 0 3.438349 — 1 —1.28433 — 2 
2.0 —1.94720 + 1 —9.32424 + 0 1.34214 — 1 —9.24495 — 3 
2.1 —2.16882 + 1 — 9.34869 + 0 —8.02848 — 2 2.21756 — 3 
2.2 —2.39406 + 1 —9.19674 + 0 —2.51019 — 1 1.09934 — 2 
2.3 —2.62120 + 1 —8.87007 + 0 —3.44099 — 1 7.19680 — 3 
2.4 —2.84846 + 1 —8.37588 + 0 —3.46476 — 1 —2.21224 — 3 
2.5 —3.07405 + 1 —7.72616 + 0 —2.66729 — 1 —6.65003 — 3 
2.6 —3.29612 + 1 —6.93789 + 0 —1.31077 — 1 —7.12408 — 3 
2.7 —3.51286 + 1 —6.03209 + 0 2.39279 — 2 8.00041 — 4 
2.8 —3.72245 + 1 — 5.03334 + 0 1.60448 — 1 8.27644 — 3 
2.9 —3.92310 + 1 —3.96899 + 0 2.48208 — 1 1.58822 — 3 
3.0 — —4.113808 + 1 —2.86785 + 0 2.70290 — 1 —1.20582 — 3 

N=0 n=2 
x c=1 c=2 c=5 ce = 10 
0.1 9.39351 — 1 9.35161 — 1 8.84844 — 1 9.05937 — 1 
0.2 ~ 1.08208 + 0 1.06262 + 0 9.01678 — 1 5.16232 — 1 
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N=0 n=2 

c=1 = 2 c=5 c = 10 

.66349 — 1 8.22287 — 1 5.05390 — 1 —3.42126 — 1 
.64401 — 1 2.96652 — 1 —1.35886 — 1 —9.82679 — I 
.05506 — 1 —3.80932 — 1 —7.81957 — 1 —8.93819 — 1 
.56032 — 1 —1.00914 + 0 —1.16120 + 0 —7.44970 — 2 
.32240 + 0 —1.31928 + 0 — 1.03567 + 0 9.85253 — 1 
.05832 + 0 —9.83017 — 1 —2.64525 — 1 1.65961 + 0 
.65717 — 1 3.74968 — 1 1.15147 + 0 1.60911 + 0 
.16217 + 0 3.16046 + 0 3.05638 + 0 9.70106 — 1 
22453 + 0 7.79020 + 0 5.16083 + 0 2.03097 — 1 
.61238 + 1 1.46703 + 1 7.09600 + 0 —2.55403 — 1 
760385 + 1 2.41735 + 1 8.48753 + 0 —2.68864 — 1 
34731 + 1 3.66166 + 1 9.038589 + 0 -—2.22585 — 2 
.46001 + 1 5.22386 + 1 8.58154 + 0 1.83653 — 1 
.19010 + 1 7.11812 + 1 7.14579 + 0 1.76198 — 1 
.26331 + 2 9.34716 + 1 4.93120 + 0 9.12279 — 3 
.68874 + 2 1.19010 + 2 2.28768 + 0 —1.36988 — 1 
.20527 + 2 1.47564 + 2 —3.55354 — 1 —1.39402 — 1 
.82291 + 2 1.78757 + 2 —2.57167 + 0 —1.18170 — 2 
.55155 + 2 2.12083 + 2 —4.02160 + 0 1.05884 — 1 
.40084 + 2 2.46904 + 2 —4.51700 + 0 1.188382 — 1 
.38002 + 2 2.82468 + 2 —4.05482 + 0 1.72440 — 2 
.49779 + 2 3.179383 + 2 —2.81239 + 0 —8.73616 — 2 
.76218 + 2 8.52375 + 2 —1.10632 + 0 —7.36407 — 2 
.18088 + 2 3.84832 + 2 6.76904 — 1 —2.63630 — 2 
.07586 -+ 3 4.14324 + 2 2.16111 + 0 3.13871 — 2 
.25021 + 3 4.39892 + 2 3.05658 + 0 9.47047 — 2 
.44146 + 3 4.60641 + 2 3.21902 + 0 7.913812 — 3 
.64988 + 3 4.75731 + 2 2.66304 + 0 —5.44166 — 2 

=0 n=3 

c=1 c=2 c=5 c= 10 

.04335 + 0 1.03904 + 0 1.00396 + 0 8.16002 — 1 
-41806 — 1 9.22225 — 1 7.82101 — 1 2.80879 — 1 
91804 — 1 2.54987 — 1 1.09411 — 2 —§.85742 — 1 
66503 — 1 —6.04220 — 1 —8.19616 — 1 —9.01249 — 1 
16122 + 0 —1.17035 + 0 —1.15387 + 0 —2.96465 — 1 
.045387 + 0 —1.00493 + 0 —6.55699 — 1 7.15663 — 1 
94405 — 2 3.87818 — 3 5.00576 — 1 1.12898 + 0 
.23639 + 0 1.27745 + 0 1.45426 + 0 2.99477 — 1 
.17030 + 0 1.12044 + 0 6.85065 — 1 —1.44767 + 0 
-74163 + 0 —3.74119 + 0 —3.72277 + 0 —2.98825 + 0 
.94508 + 1 —1.86263 + 1 —1.36118 + 1 —3.23654 + 0 
.51898 + 1 —5.13331 + 1 —3.01767 + 1 —1.98417 + 0 
24365 + 2 —1.12506 + 2 —§.34231 + 1 —6.65948 — 2 
.45530 + 2 —2.15895 + 2 —8.18229 + 1 1.273815 + 0 
.43417 + 2 —3.78480 + 2 —1.12269 + 2 1.31011 + 0 
.50023 + 2 —6.20420 + 2 —1.40431 + 2 3.14866 — 1 
.20572 + 3 —9.64812 + 2 —1.61419 + 2 —7.44769 — 1 
.86039 + 3 —1.43724 + 3 —1.70733 + 2 —1.00264 + 0 
.77451 + 3 —2.06513 + 3 —1.65204 + 2 —4,10716 — 1 
.02027 + 3 —2.87683 + 3 —1.438828 + 2 4.24001 — 1 
.68257 + 3 —3.90062 + 3 —1.08200 + 2 8.04224 — 1 
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N=0 n=3 
x c=1 c=2 e=5 c= 10 
2.2 —7.85998 + 3 —5.16340 -+ 3 —6.24706 + 1 4.53989 — 1 
2.3 —1.06656 + 4 —6.68939 + 3 —1.27832 + 1 —2.40332 — 1 
2.4 —1.42276 + 4 —8.49881 + 3 3.37407 + 1 —6.90334 — 1 
2.5 —1.86902 + 4 —1.06060 + 4 7.02754 + 1 —3.30234 — 1 
2.6 —2.421385 + 4 —1.30186 + 4 9.15596 + 1 1.36373 — 1 
2.7 —3.09743 + 4 —1.57357 + 4 9.49226 + 1 2.58923 — 1 
2.8 —3.91655 -+ 4 —1.87469 + 4 8.08334 + 1 4.73302 — 1 
2.9 —4,89962 + 4 —2.20322 + 4 5.29160 + 1 —7.99752 — 3 
3.0 —6.06911 + 4 —2.55591 + 4 1.70562 + 1 —4.71948 — 1 

N=1n=0 
x c=1 c=2 c=5 c= 10 
0.1 6.67799 — 2 7.82376 — 2 1.75066 — 1 3.92683 — 1 
0.2 1.88413 — 1 2.19147 — 1 4.70668 — 1 9.77051 — 1 
0.3 3.44707 — 1 3.96102 — 1 7.93680 — 1 1.443897 + 0 
0.4 5.27637 — 1 5.96043 — 1 1.08116 + 0 1.62479 + 0 
0.5 7.31901 — 1 8.08692 — 1 1.28498 + 0 1.49159 + 0 
0.6 9.53337 — 1 1.02496 + 0 1.387488 + 0 1.13769 + 0 
0.7 1.18837 + 0 1.23655 + 0 1.34097 + 0 7.138592 — 1 
0.8 1.438380 + 0 1.48586 + 0 1.19359 + 0 3.52409 — 1 
0.9 1.68661 + 0 1.61605 + 0 9.60016 — 1 1.21545 — 1 
1.0 1.94898 + 0 1.77112 + 0 6.78651 — 1 1.738276 — 2 
1.1 2.20321 + 0 1.89606 + 0 3.91685 — 1 —6.46094 — 3 
1.2 2.46172 + 0 1.98689 +- 0 1.37762 — 1 —6.17021 — 4 
1.3 2.71702 + 0 2.04079 + 0 —5.42516 — 2 4.36117 — 3 
1.4 2.96675 + 0 2.05612 + 0 —1.69153 — 1 1.77204 — 3 
1.5 3.20863 + 0 2.03244 + 0 —2.06531 — 1 —2.50040 — 3 
1.6 3.44053 + 0 1.97051 + 0 —1.79111 -— 1 —2.92887 — 3 
1.7 3.66041 + 0 1.87221 + 0 —1.08794 — 1 5.03281 — 5 
1.8 3.86640 + 0 1.74049 + 0 —2.14572 — 2 2.47164 — 3 
1.9 4.05675 + 0 1.57927 + 0 5.85005 — 2 1.944038 — 3 
2.0 4.22984 + 0 1.39322 + 0 1.12680 — 1 —5.40072 — 4 
2.1 4.38426 + 0 1.18771 + 0 1.31527 — 1 —2.13282 — 3 
2.2 4.51871 + 0 9.68543 — 1 1.15060 — 1 —1.41516 — 3 
2.3 4.63210 + 0 7.41814 — 1 7.15483 — 2 6.26047 — 4 
2.4 4.72350 + 0 5.18706 — 1 1.46127 — 2 1.92158 — 3 
2.5 4.79216 + 0 2.90272 — 1 —4.05001 — 2 7.95172 — 4 
2.6 4.83753 + 0 7.72847 — 2 —8.05860 — 2 —6.86022 — 4 
2.7 4.85923 + 0 —1.19954 — 1 —9.73081 — 2 —7.63503 — 4 
2.8 4.85708 + 0 —2.96762 — 1 —8.86612 — 2 —9.86627 — 4 
2.9 4.83107 + 0 —4,49226 — 1 —5.89386 — 2 1.47764 — 4 
3.0 4.78140 + 0 —5.74265 — 1 —1.69513 — 2 1.50302 — 3 

N=1%n=1 
x c=1 c=2 e=5 c = 10 
0.1 1.78797 — 1 1.86209 — 1 2.26456 — 1 4.69291 — 1 
0.2 4.81623 — 1 4.98465 — 1 5.77313 — 1 1.01896 + 0 
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N=1n=1 
c= c=2 e=5 c= 10 
8.11822 — 1 8.30625 — 1 8.81124 — 1 1.10226 + 0 
1.09167 + 0 1.09943 + 0 1.01214 + 0 5.69582 — 1 
1.24498 + 0 1.22371 + 0 8.87650 — 1 —3.54754 — 1 
1.19004 + 0 1.12484 + 0 4.82831 — 1 —1.21477 + 0 
8.42479 — 1 7.29666 — 1 —1.63619 — 1 —1.61645 + 0 
1.15744 — 1 —2.61055 — 2 —9.56512 — 1 —1.44879 + 0 
—1.07772 + 0 —1.19460 + 0 —1.76308 + 0 —9.06520 — 1 
—2.82516 + 0 —2.81285 + 0 —2.43994 + 0 —3.26420 — 1 
—5.21224 + 0 —4.90037 + 0 —2.86233 + 0 3.12685 — 2 
—8.32175 + 0 —7.45742 + 0 —2.94990 + 0 1.08983 — 1 
—1.22323 + 1 —1.04640 + 1 —2.68284 + 0 2.38728 — 2 
—1.70172 + 1 —1.38798 + 1 —2.10636 -+- 0 —6.19490 — 2 
—2.27431 + 1 —1.76444 + 1 —1.32097 + 0 —6.25964 — 2 
—2.94693 + 1 —2.16792 + 1 —4.62707 — 1 —1.893805 — 4 
—3.72461 + 1 —2.58893 + 1 3.24280 — 1 5.11365 — 2 
—4.61145 + 1 —3.01663 + 1 9.15552 — 1 4.39191 — 2 
—5.61051 + 1 —3.43921 + 1 1.22972 + 0 —3.52397 — 3 
—6.72374 + 1 —3.84424 + 1 1.24232 + 0 —3.87077 — 2 
—7.95192 + 1 —4.21912 + 1 9.87598 — 1 —3.49613 — 2 
—9.29459 + 1 —4.55151 + 1 5.48485 — 1 2.81473 — 3 
—1.07501 + 2 —4.82978 + 1 3.678638 — 2 3.09193 — 2 
—1.23154 + 2 —5.04853 + 1 —4.30799 — 1 3.14921 — 2 
—1.39863 + 2 —5.18371 + 1 —7.57370 — 1 —6.17863 — 4 
—1.57573 + 2 — 5.24330 + 1 —8.84163 — 1 —2.95268 — 2 
—1.76214 + 2 —5.21739 + 1 —8.00250 — 1 — 1.28822 — 2 
—1.95708 + 2 —5.10350 + 1 —5.41499 — 1 —1.34150 — 3 
—2.15961 + 2 —4,90184 + 1 —1.79916 — 1 6.07715 — 3 
—2.36871 + 2 —4.61486 + 1 1.95783 — 1 2.63666 — 2 
N=1%n"=2 
c=1 c=2 c=5 c= 10 
3.17572 — 1 3.23375 — 3.58452 — 1 4.61327 — 1 
7.89038 — 1 7.98456 — 8.45034 — 1 8.84405 — 1 
1.13929 + 0 1.13924 + 1.10027 + 0 6.93291 — 1 
1.16268 + 0 1.13700 9.09831 — 1 —8.13386 — 2 
7.46867 — 1 6.88194 2.60402 — 1 —8.60192 — 1 
—6.419385 — 2 —1.40156 —6.18349 — 1 —9.72981 — 1 
—9.90830 — 1 —1.04501 —1.27959 + 0 —2.01002 -— 1 
—1.44413 + 0 —1.42980 —1.15587 + 0 1.024386 +- 0 
—4.438333 — 1 —3.60476 2.92752 — 1 1.92471 + 0 
3.46336 + 0 3.46067 3.41429 + 0 1.94059 + 0 
1.22761 + 1 1.16998 8.22613 + 0 1.12202 + 0 
2.86165 + 1 2.63729 1.48419 + 1 5.54149 — 2 
5.57952 + 1 4.98022 2.09872 + 1 —5.92374 — 1 
9.78715 + 1 8.45483 2.71147 + 1 —5.45072 — 1 
1.59705 + 2 1.33319 3.15908 + 1 — 5.23392 — 2 
2.46994 + 2 1.98856 3.343813 + 1 3.78477 — 1 
3.663808 + 2 2.83810 3.20166 + 1 4.06284 — 1 
5.25100 + 2 3.90597 2.72591 + 1 7.68751 — 2 
7.31713 + 2 5.21256 1.96583 + 1 —2.68863 — 1 
9.95363 + 2 6.77296 1.02468 + 1 —3.08683 — 1 
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N=1n=2 
x c= 1 c=2 c=5 c= 10 
2.1 1.32612 + 3 8.59566 + 2 4.17984 — 1 —9.67834 — 
2.2 1.73484 + 3 1.06812 + 3 —8.32684 + 0 1.95130 — 
2.3 2.23316 + 3 1.30213 + 3 —1.466138 + 1 2.53304 — 
2.4 2.83336 + 3 1.55981 + 3 —1.76896 + 1 1.16918 — 
2.5 3.54832 + 3 1.83831 + 3 —1.71224 + 1 —9.81521 — 
2.6 4.39188 + 3 2.18379 + 3 —1.33267 + 1 —2.49959 — 
2.7 5.37625 + 3 2.44140 + 3 —7.25156 + 0 —5.82649 — 
2.8 6.51688 + 3 2.75538 + 3 —2.34462 — 1 9.65074 — 
2.9 7.82726 + 3 3.06921 + 3 6.27542 + 0 5.32562 — 
3.0 9.32135 + 3 3.37554 + 3 1.10027 + 1 1.36992 — 

N=1 "=3 
x c=1 c=2 c=5 c= 10 
0.1 4.70292 — 1 4.74792 — 1 5.03663 — 1 5.60653 — 
0.2 1.03640 + 0 1.038944 + 0 1.05094 + 0 9.52509 — 
0.3 1.15261 + 0 1.13873 + 0 1.02612 + 0 4.75116 — 
0.4 5.87098 — 1 5.49119 — 1 2.80849 — 1 —5.29321 — 
0.5 —4.34815 — 1 —4.789438 — 1 —7.50431 — 1 —1.03082 + 
0.6 —1.22557 + 0 —1.23658 + 0 —1.23879 + 0 —3.74929 — 
0.7 —9.49593 — 1 —9.03408 — 1 —5.28507 — 1 8.87232 — 
0.8 5.95761 — 1 6.568382 — 1 1.04428 ++ 0 1.26903 + 
0.9 1.58393 + 0 1.56393 + 0 1.35005 + 0 —3.80514 — 
1.0 —3.99973 + 0 . —3.99876 + 0 —3.98415 + 0 —3.66768 + 
1.1 —2.94218 + 1  .—2.82967 + 1 —2.13760 + 1 —6.63805 + 
1.2 —9.93028 + 1 © —9.31025 + 1 —5.82859 + 1 —7.12958 + 
1.3 —2.54877 + 2 —2.33198 + 2 —1.21470 + 2 —4.48170 + 
1.4 —5.60512 + 2 —5.00166 + 2 —2.14807 + 2 —1.91044 — 
1.5 —1.11157 + 3 —9.66261 + 2 —3.37164 -+ 2 3.09018 + 
1.6 —2.043869 + 3 —1.72812 + 3 —4.80966 + 2 3.47603 + 
1.7 —3.54344 + 3 —2.91000 + 8 —6.31861 + 2 1.24407 + 
1.8 —5.86049 + 3 —4.66630 + 3 —7.70049 + 2 —1.54042 +- 
1.9 —9.32112 + 3 —7.18302 + 3 —8.72930 + 2 —2.73125 + 
2.0 —1.43431 + 4 —1.06778 + 4 —9.19104 + 2 —1.438583 + 
2.1 —2.14517 + 4 —1.53984 + 4 —8.92696 + 2 7.02031 — 
2.2 —3.12971 + 4 —2.16194 + 4 —7.87269 + 2 2.14894 + 
2.3 —4,46725 + 4 —2.96368 + 4 —6.08384 + 2 1.443829 + 
2.4 —6.25326 + 4 —3.97614 + 4 —3.74068 + 2 —1.90745 — 
2.5 —8.60136 + 4 —5.23076 + 4 —1.12896 + 2 —1.17683 + 
2.6 —1.16452 + 5 —6.75841 -+ 4 1.40286 + 2 —1.61055 + 
2.7 —1.55406 + 5 —8.58796 + 4 3.50030 -++ 2 —7.92157 — 
2.8 —2.04672 + 5 —1.07450 + 5 4.86619 -+ 2 1.29119 + 
2.9 —2.66305 + 5 —1.32504 + 5 5.32282 + 2 3.73378 — 
3.0 —3.426383 + 5 —1.61182 + 5 4.83375 + 2 4.32692 — 

N=2 n=0 
x c=1 c=2 c=5 c= 10 
0.1 8.11214 — 3 9.30928 — 3 2.21477 — 2 8.00048 
0.2 4.58035 — 2 5.22724 — 2 1.20067 — 1 4.01321 





het DO 
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N=2 n=0 
x c= 1 c=2 c=5 
0.3 1.25827 — 1 1.42274 — 1 3.07992 — 1 
0.4 2.57171 — 1 2.87029 .— 1 5.71015 — 1 
0.5 4.46741 — 1 4.90296 — 1 8.72575 — 1 
0.6 6.99877 — 1 7.52394 — 1 1.16320 + 0 
0.7 1.02059 + 0 1.07049 + 0 1.39112 + 0 
0.8 1.41171 + 0 1.43889 + 0 1.51294 + 0 
0.9 1.87493 + 0 - 1.84932 + 0 1.50259 + 0 
1.0 2.41089 + 0 2.29129 + 0 1.35656 -+ 0 
1.1 3.01924 -+ 0 2.75254 + 0 1.09462 + 0 
1.2 3.69863 + 0 3.21944 + 0 7.56039 — 1 
1.3 4.44678 + 0 3.67758 + 0 3.919138 — 1 
1.4 5.26054 + 0 4.11222 + 0 5.554384 — 2 
1.5 6.13588 + 0 4.50887 + 0 —2.07467 — 1 
1.6 7.06802 -+ 0 4.85377 + 0 —3.67156 — 1 
1.7 8.05189 + 0 5.138443 + 0 —4.13815 — 1 
1.8 9.07977 + 0 5.34008 + 0 —3.58514 — 1 
1.9 1.01463 + 1 5.46205 + 0 —2.29484 — 1 
2.0 1.12436 + 1 5.49406 + 0 —6.51947 — 2 
2.1 1.23637 + 1 5.48250 + 0 9.353388 — 2 
2.2 1.34985 + 1 5.27657 + 0 2.12220 — 1 
2.3 1.46391 + 1 5.02833 + 0 2.68832 — 1 
2.4 1.57769 + 1 4.69273 + 0 2.57171 — 1 
2.5 1.69026 + 1 4.27721 + 0 1.86697 — 1 
2.6 1.80072 + 1 3.79179 + 0 7.90086 — 2 
2.7 1.90816 + 1 3.24856 + 0 —3.78705 — 2 
2.8 2.01167 + 1 2.661389 + 0 —1.36199 — 1 
2.9 2.11086 + 1 2.04550 + 0 —1.94874 — 1 
3.0 2.20337 + 1 1.41675 + 0 —2.03218 — 1 
N= n=1 
x c=1 c=2 c=5 
0.1 3.01525 — 2 3.18266 — 2 4.36114 — 2 
0.2 1.63412 — 1 1.71721 — 1 2.27802 — 1 
0.3 4.17552 — 1 4.35481 — 1 5.46215 — 1 
0.4 7.63468 — 1 7.87547 — 1 9.08386 — 1 
0.5 1.12468 + 0 1.14274 + 0 1.16775 + 0 
0.6 1.37489 + 0 1.36734 + 0 1.16137 + 0 
0.7 1.33292 + 0 1.28261 + 0 7.56169 — 1 
0.8 7.66020 — 1 6.71043 — 1 —1.09012 — 1 
0.9 —6.15776 — 1 —7.14755 — 1 —1.389595 + 0 
1.0 —3.156838 + 0 —3.14077 + 0 —2.96182 + 0 
1.1 —7.25498 + 0 —6.87847 + 0 —4.57844 + 0 
1.2 —1.33591 + 1 —1.21904 + 1 —5.97349 + 0 
1.3 —2.19659 + 1 —1.93150 + 1 —6.88402 + 0 
1.4 —3.36151 + 1 —2.84518 + 1 —7.11308 + 0 
1.5 —4.88847 + 1 —3.97466 + 1 —6.57336 + 0 
1.6 —6.83847 + 1 —5.32791 + 1 —5.312438 + 0 
1.7 —9.27502 + 1 —6.90514 + 1 —3.50808 + 0 
1.8 —1.22634 + 2 —8.69800 + 1 —1.43922 + 0 
1.9 —1.58698 + 2 —1.06891 + 2 5.66677 — 1 
2.0 —2.01606 + 2 —1.28518 + 2 2.19548 + 0 
2.1 —2.52012 + 2 —1.51503 + 2 3.20680 + 0 


c = 10 


-01985 
38094 
.62971 
54932 
19392 
25292 
16554 
.79148 
.02873 
.46045 
.04085 
.52034 
.29918 
.11917 
.67213 
95354 
.62330 
.50298 
.03792 
19124 
06774 
94735 
.94597 
.04030 
.03146 
—6.15552 
—1.13042 

1.09442 


CO pet et Re et ND OD CO NT et et 


11 
Cr HH OLOO LN CO 


—7 


c= 10 


1.10168 
5.09403 
9.75159 
1.10096 
6.37139 
87398 
22293 
.69212 
.00113 
.45332 
47679 
2.35668 
2.24248 
3.10851 
.65861 
.46461 

2.53438 

1.23063 

1.16626 

3.46739 
—9.50870 
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TaBue II — Continued 





N=2 n=1 
x c=1 c= 2 c=5 c= 10 
2.2 —3.10555 + 2 —1.75404 + 2 3.47937 + 0 —1.00215 — 
2.3 —3.77844 + 2 —1.99703 + 2 3.03214 + 0 —1.01271 — 
2.4 —4,544538 + 2 —2.23824 + 2 2.01698 + 0 7.86142 — 
2.5 —5.40910 + 2 —2.47136 + 2 6.84586 — 1 6.25620 — 
2.6 —6.37687 + 2 —2.68987 + 2 —6.69192 — 1 1.86542 — 
2.7 —7.45191 + 2 —2.88715 + 2 —1.76387 + 0 —2.83514 — 
2.8 —8.63756 + 2 —3.05673 + 2 —2.39029 + 0 —8.09642 — 
2.9 —9.93633 + 2 —3.19263 + 2 —2.45263 + 0 —6.05693 — 
3.0 —1.13498 + 3 —3.28923 + 2 —1.97549 + 0 4.943837 — 

N=2 n=2 
x c=1 c=2 c=5§ c= 10 
0.1 6.92988 — 2 7.12673 — 2 8.51196 — 2 1.81062 — 
0.2 3.52496 — 1 3.60980 — 1 4.18221 — 1 5.67604 — 
0.3 8.01544 — 1 8.14659 — 1 8.938117 — 1 9.45356 — 
0.4 1.20746 + 0 1.21237 + 0 1.21221 + 0 7.67342 — 
0.5 1.27474 + 0 1.25311 + 0 1.05255 + 0 —3.76902 — 
0.6 7.57995 — 1 7.02242 — 1 2.84685 — 1 —9.13196 — 
0.7 —3.27092 — 1 —3.94312 — 1 —8.22806 — 1 —1.05173 + 
0.8 —1.41364 + 0 —1.43709 + 0 —1.49256 + 0 —7.12440 — 
0.9 —1.01239 + 0 —9.55456 — 1 —4,82515 — 1 1.57998 + 
1.0 3.74003 + 0 3.73507 + 0 3.69332 + 0 2.86776 + 
1.1 1.76108 + 1 1.68770 + 1 1.23963 + 1 2.91426 + 
1.2 4.78738 + 1 4.45904 + 1 2.64247 + 1 1.66265 ++ 
1.3 1.04979 + 2 9.51393 + 1 4.56130 + 1 —6.78936 — 
1.4 2.03280 + 2 1.79119 -+ 2 6.85921 + 1 —1.18788 + 
1.5 3.61809 + 2 3.09537 + 2 9.27854 + 1 —1.12986 + 
1.6 6.05086 + 2 5.01768 +- 2 1.14715 + 2 —2.05881 — 
1.7 9.63954 + 2 7.733857 + 2 1.30588 + 2 7.08003 — 
1.8 1.47641 + 3 1.14868 + 3 1.86797 + 2 8.79453 — 
1.9 2.18846 + 3 1.683438 + 3 1.31147 + 2 3.17527 — 
2.0 3.15489 + 3 2.26396 + 3 1.12984 + 2 —4.11104 — 
2.1 4.44006 + 3 3.05647 + 3 8.37540 + 1 —7.13268 — 
2.2 6.11858 + 3 4.08107 + 3 4.68923 + 1 —3.71699 — 
2.3 8.27598 + 3 5.20576 + 3 7.34290 + 0 2.40288 — 
2.4 1.10092 + 4 6.59547 + 3 —2.92519 + 1 6.17233 — 
2.5 1.44270 + 4 8.21064 + 3 —5.75607 + 1 2.78319 — 
2.6 1.86502 + 4 1.00565 + 4 —7.35509 + 1 —1.46009 — 
2.7 2.38122 + 4 1.21320 + 4 —7.52754 + 1 —2.33031 — 
2.8 3.00582 + 4 1.44287 -+ 4 —6.32751 + 1 —4.05877 — 
2.9 3.75458 + 4 1.69309 + 4 —4.05587 + 1 1.20902 — 
3.0 4.64442 + 4 1.96136 + 4 —1.18748 + 1 4.27288 — 

N=2 n=3 
x c=1 c=2 c=5 c= 10 
0.1 1.26968 — 1 1.29109 — 1 1.44081 — 1 1.94166 — 
0.2 5.91772 — 1 5.99200 — 1 6.48662 — 1 7.75163 — 
0.3 1.13308 + 0 1.13787 + 0 1.15991 + 0 1.06828 + 
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N=2 n=3 

x c=1 c= 2 c= 5 c = 10 

0.4 1.22242 + 0 1.20735 + 0 1.08268 + 0 4.42410 — 1 
0.5 5.02125 — 1 4.61442 — 1 1.728384 — 1 —7.08002 — 1 
0.6 —7.28650 — 1 —7.67417 -— 1 —9.99672 — 1 —1.12281 + 0 
0.7 —1.34994 + 0 —1.33947 + 0 —1.20442 + 0 —6.35200 — 2 
0.8 —1.55815 — 1 —9.78895 — 2 3.13440 — 1 1.38983 + 0 
0.9 1.72855 + 0 1.73119 + 0 1.69507 + 0 6.83573 — 1 
1.0 —4.24197 + 0 —4,23990 + 0 —4.22216 + 0 —4.05639 + 0 
1.1 —4.29825 + 1 —4.14973 + 1 —3.22563 + 1 —1.17483 + 1 
1.2 —1.69966 + 2 —1.604386 + 2 —1.05765 + 2 —1.81766 + 1 
1.3 —4.92189 + 2 —4,54612 + 2 —2.55016 + 2 —1.86221 + 1 
1.4 —1.19761 + 3 —1.08177 + 3 —5.12503 + 2 —1.15388 + 1 
1.5 —2.59483 + 3 —2.28970 + 3 —9.05339 + 2 —3.41475 — 1 
1.6 —5.16494 + 3 —4.44652 + 3 —1.44608 + 3 8.52760 + 0 
1.7 —9.62735 + 3 —8.07479 + 3 —2.12354 + 3 1.01259 + 1 
1.8 —1.70216 + 4 —1.38880 + 4 —2.89670 + 3 4.31901 + 0 
1.9 —2.88067 + 4 —2.28281 + 4 —3.69313 + 3 —3.63918 + 0 
2.0 —4.69805 + 4 —3.61016 + 4 —4.41450 + 3 —7.22580 + 0 
2.1 —7.42193 + 4 —5.52126 + 4 —4.94883 + 3 —5.31249 + 0 
2.2 —1.14040 + 5 —8.19885 + 4 —5.18833 + 3 8.57079 — 1 
2.3 —1.70988 + 5 —1.18598 + 5 —5.05014 + 3 5.14940 + 0 
2.4 —2.50841 + 5 —1.67557 + 5 —4.49592 + 3 5.60742 + 0 
2.5 —3.60853 + 5 —2.31714 + 5 —3.54624 + 3 5.87060 — 1 
2.6 —5.10005 + 5 —3.14232 + 5 —2.28432 + 3 —4.18090 + 0 
2.7 —7.09289 + 5 —4.18531 + 5 —8.50402 + 2 —2.39448 + 0 
2.8 —9.72020 + 5 —5.48229 + 5 5.76599 + 2 —1.90159 + 0 
2.9 —1.31415 + 6 —7.07072 + 5 1.80814 + 3 6.87950 — 1 
3.0 —1.75464 + 6 —8.98762 + 5 2.67521 + 3 4.80391 + 0 


I am indebted to Mrs. E. Sonnenblick for programming and carrying 
out the computations reported here. 


APPENDIX A 


A Perturbation Scheme 


We treat briefly the following problem. Eigenfunctions u, and eigen- 
values \n of an operator L are assumed known. That is, we have 


Lun + AnUn = 0, =O, 8s (114) 


It is desired to find eigenfunctions y, and eigenvalues x, of the perturbed 
equation 


(L — M)y + xv = 0. (115) 
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It is assumed that the wu, satisfy the boundary condition to be imposed 
on the y’s and that the u, are complete in some appropriate sense. We 
proceed further in a purely formal manner. 

Substitute the series 


Yn = Un + 2a €'Q: (116) 
Xn = An + > ea; (117) 
j=l 


into (115). Here in the notation we have suppressed the dependence of 
the Q; and a; on n. By equating to zero the coefficients of distinct powers 
of e, we find 


Lith + Ann = O (118) 
i 
LQ; + AQ; = MQ; — dy anQij- ; (119) 
PS, 2 iis 


where we define Q, = Un. 

Now it frequently happens that the operator M is such that Mu, can 
be expressed as a finite linear combination of the w’s with constant coef- 
ficients. We assume this to be the case and write 


l 

4 

Mu, = ya Untia 
i=—l 


(120) 
= O12) oak 

Here a is a positive integer, / a nonnegative integer, and the superscript 

zon y is not an exponent, but a label. 


If the u, are linearly independent, formal solution of system (119) 
is now straightforward. Set 


jl ; 
a2 Ap Untka 


Q;= 
k=—7l 
A i is 0 (121) 
qed, 2p es 


The A’s of course depend on n, but for simplicity we have suppressed 
this fact in the notation. Again the superscript is a label, not an ex- 
ponent. Substitute (121) and (120) into (119). Setting the coefficient 
of uw, equal to zero in the resultant expression yields 
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l 
Gps Aa Wear, FS VQ cc. (122) 
k=l 
Requiring the coefficient of unime to vanish gives 


i l 
: oe jok jl k 
(Antma — An)Am? = »» aA m — Ame Yn-+(m—k)a (123) 
m= —jl, —jl+1,...,95 3 =1,2,.... 
Here we have adopted the conventions 
A,’ = 0 
if either 
|k| > jl, or ak < —n, or k=O and j = 1,2,... 
A,=1, <A, =0, k¥0,a,=0. 


Equations (122) and (123) together with these conventions permit 
successive determination of the a’s and A’s. The case / = 1 occurs fre- 
quently. The first few coefficients for this case are given below where 
we have set 


hi = [Angi — Mal. 


a= Yn 
A_; = —hw¥n 
Ay’ = —heyn’ 
a, = —[havnYnta + han Yn~a | 


A_g” = heah—cyn “Yn-a 
(ha) Yn [—Y¥n + Yn-a | 

(he) yn [—¥n + Yn4el 

Ag = heala’Yn Ynta 

(ha) ¥n'Ynta (Yn + yng) + (haa) Yn na (— Yn + Yn—a) 
halAr(yn — Ynta) + 2A — Yn420 Ae] 

Ay’ = hoal[A-a' (Yn — Yn—a ) + 2A” — Yn-20 A-s'| 


3 - 3 
a4 = Ay Yn+ea , + A_j Vines 


pp 
ll Il 


g 
i) 
I 


= 
l 


More generally for this case one finds 
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. . i +1 
J J M 
Ay; i (-1) I NgceVns(k—1)a ’ J = 1, Dan's 
j-1 
j j-l 0 ; 
Axgay’ = Axgen 2 hgralai — Yntea |; j = 2,3,... 


Ax»? = Ag» ie (124) 
X [de + hacgyeYnxdina’ Yngla 
+ (ay = Ynsta ) > Naka = Ynska)|; 
oe aoe 
APPENDIX B 


Evaluation of an Integral 
We establish here the formula (43). Let 


Fyn(t) = i Ty(ay)V/ayT wn (y) ay 
(125) 


7 i Ky (ay) Twin (y)dy 


on using the notation of (21). Then 


E a’ + 20 f+ (2! = x) | Pr alo) 


7 det 
= i Trait y Ky (zy) + 2ryKy' (ay) + (27 — x) Ky(ay)]dy (126) 


1 
= | Prnldl(— ay = +N +t 


— x)Ky(zy) + 2ayKw' (xy)ldy 


by (23). Here primes denote differentiation with respect to the argu- 
ment indicated. 
Now 


d page dT y,n(y) (: —N ) = 
iy eae aes ye x Txrn(y) =0 
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with x given by (26). Multiply this equation by Ky(ay) and integrate 
from zero to one. There results 
‘) Pv.» | dy 


1 # =: 2 
o= Kol [2c ~ yf) Me 4 (EN 
Y 
= [tant [SO = v9 Zola) + (EGE + x) Kote | 
where we have integrated by parts and made use of the fact that 


Ky(0) = Ty,,(0) = 0. Carrying out the indicated differentiation, we 
find 











C= I Tynly) | — ya Ky’ (cy) — 2ayKy (ay) 


: (? as iis x) Kyau) dy (127) 


1 
-f Tralyl(—xy — 4 +N? + a? — x)Ky(cy) 





+ QeyKy’ (xy) |dy. 
Equations (126) and (127) give 


Ee j a@ 2+ anf as (0 _ (w + 2n+ S)(w +20 +8) | Peale = 0. 


The only solution of this equation that vanishes for x = 0 is 


J w42n41(X) 


Fyn(2) = k “/% 
Using (125), we then have 


pe = [ Iw (ay) VayTw.n(y) dy. (128) 


To determine k, we have only to compare the coefficients of «”?"4? 


on both sides of (128). In this way we find 


k 
QN+22411 (N + 2n + 2) 


(—1)" [ N+2n4+3 
SS , nr d . 
PIN Gap dal l Yo Pum (way 


(129) 
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The integral here can be evaluated by using (27), (28) and known 
properties of the Jacobi polynomials. We have 


1 
[ yn rT a (y)dy 


ae 
= (" + ” [ yprtentip OV (y oy) dy 


n 
-—l 1 ‘ 
= gon) & ‘3 ¥) [ (1 — u)*(1 — u)* Pa (udu. 


Now the coefficient of wu” in (1 — wu)" is (—1)” and the coefficient of 


wu” in P, (uw) is @ e| af ) i 2” [Ref. 8, Vol. II, p. 169, Eq. (5), 
sO 
n A=) (N ,0) — (N ,0) 
n 


It follows, then, that 


i: QQ —u)*01 — uP” (u) du 


—1 nQn 1 
= (re) a (1 — u)* Pa (u) Pa (u) du 
n 


= (—1) 72"t+s41 
ae (Qn + N +1) 


where we have used the orthogonality of the Jacobi polynomials and 
the known normalization integral [see Ref. 5, page 68, Eq. (4.3.3), for 
example]. Combining these results, (129) yields 


—_ n+ N\"* 
“kn 


and together with (128) this establishes (43). 


REFERENCES 


1. Slepian, D., and Pollak, H. O., Prolate Spheroidal Wave Functions, Fourier 
Analysis and Uncertainty — I, B.S.T.J., 40, Jan., 1961, pp. 43-64. 

2. Landau, H. J., and Pollak, H. O., Prolate Spheroidal Wave Functions, Fouricr 
Analysis and Uncertainty — II, B.S.T.J., 40, January, 1961, pp. 65-84. 


Ro OU AOA N OO om Fe Ww 


—os 


12. 


PROLATE SPHEROIDAL WAVE FUNCTIONS — IV 3057 


. Fox, A. G., and Li, Tingye, Resonant Modes in a Maser Interferometer, 


B.S.T.J., 40, March, 1961, pp. 453-488. 


. Riesz, F., and Sz.-Nagy, B., Functional Analysis, Frederick Ungar Co., New 


York, 1955. 


. Szegd, G., Orthogonal Polynomials, Am. Math. Soc. Colloquium Publication 


23, New York, 1959. 


. Bouwkamp, C. J., On Spheroidal Wave Functions of Order Zero, J. Math. 


Phys., 26, 1957, pp. 79-92. 


. Slepian, D., Some Asymptotic Expansions for Prolate Spheroidal Wave Func- 


tions, to be published. 


. Erdelyi, A., Higher Transcendental Functions, MeGraw-Hill Book Co., New 


York, 1953. 


. Kogelnik, H., Modes in Optical Resonators, Section 5.1, chapter in Advances 


in Lasers, ed. Levine, A. K., Dekker Publishers, New York — to appear. 


. Slepian, D., Analytic Solution of Two Apodization Problems, to be published. 
. Heurtley, J. C., Hyperspheroidal Functions — Optical Resonators with Cir- 


cular Mirrors, to appear in Proceedings of the Polytechnic Inst. of Brooklyn 
Symposium on Quasi-Optics, June 8-10, 1964. 
Born, M., and Wolf, E., Principles of Optics, Pergamon Press, 1959. 


Contributors to This Issue 


R. D. Barnarp, B.E.E., 1952, and M.E.E., 1955, Polytechnic Insti- 
tute of Brooklyn; Ph.D., 1959, Case Institute of Technology; Bell Tele- 
phone Laboratories, 1959-61; faculty, Wayne State University, 1961-62; 
Bell Telephone Laboratories, 1962—. Presently, he is primarily con- 
cerned with theoretical problems in signal theory and control. Member, 
IEEE and American Physical Society, Sigma Xi, Eta Kappa Nu and 
Tau Beta Pi. 


Vactav E. Brengs, A.B., 1950, Harvard College; M.A. and Ph.D. 
1953, Princeton University; Bell Telephone Laboratories, 1953—. Mr. 
Bene§S has been engaged in mathematical research on stochastic processes, 
traffic theory, and servomechanisms. In 1959-60 he was visiting lecturer 
in mathematics at Dartmouth College. He is the author of General Sto- 
chastic Process in the Theory of Queues (Addison-Wesley, 1963). Member, 
American Mathematical Society, Association for Symbolic Logic, Insti- 
tute of Mathematical Statistics, SIAM, Mind Association and Phi Beta 
Kappa. 


JAMES R. Davey, B.S. in E.E., 1936, University of Michigan; Bell 
Telephone Laboratories, 1936—. He has been engaged in the design of 
telegraph and data transmission circuits for the following types of 
systems: de telegraph, multichannel AM and FM earrier telegraph, 
telegraph test and service boards, HF radio teletypewriter and a VHF 
ground-to-air data link. For the past several years he has been in charge 
of a department responsible for the development of various data ter- 
minals for use over telephone voice channels. Member, IEEE, Sigma 
Xi and Tau Beta Pi. 


JAMES P. Gorpon, B.S., Massachusetts Institute of Technology, 1949; 
M.A., 1951, and Ph.D., 1955, Columbia University; Bell Telephone 
Laboratories, 1955—.Visiting Professor of Applied Physics, University 
of California, San Diego, 1962-1963. His research in quantum elec- 
tronics has involved work on paramagnetic resonance, masers, and the 
quantum mechanical aspects of communication theory. He has written 
several technical articles in the field of quantum electronics. Member, 
AAAS, American Physical Society and Sigma Xi. 


3059 


3060 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


J. R. Gray, B.S.E.E., 1954, M.8.E.E., 1955, University of Florida; 
Bell Telephone Laboratories, 1955—. He was first engaged in repeater 
design for a PCM exchange area system, and more recently has been 
concerned with studies of signal impairment in PCM terminals. He is 
presently responsible for a group concerned with systems design of a 
high-speed PCM terminal, working with such problems as framing, 
synchronization, and signal deterioration. 


D.C. Hoaa, B.Sc., 1949, University of Western Ontario; M.Sc., 1950, 
and Ph.D., 1953, McGill University; Bell Telephone Laboratories, 
1953—. His work has included studies of artificial dielectrics for micro- 
waves, diffraction of microwaves, and over-the-horizon and millimeter 
wave propagation. He has been concerned with evaluation of sky noise, 
analysis of performance characteristics of microwave antennas and, most 
recently, propagation of optical waves. Senior member, IEEE; member, 
Commission 2, U.R.S.I., and Sigma Xi. 


T. T. Kapora, B.S., 1958, Yokahama National University (Japan); 
M.S., 1956, and Ph.D., 1960, University of California (Berkeley); Bell 
Telephone Laboratories, 1960—. He has been engaged in the study of 
noise theory with application to optimum detection theory. Member, 
Sigma Xi and SIAM. 


Herwic KoGEunik, Dipl.-Ing., 1955, Dr. techn., 1958, Technische 
Hoschschule Wien, Austria; D. Phil., 1960, Oxford University, England; 
Bell Telephone Laboratories, 1961—. He is engaged in optical maser 
research. Member, American Physical Society, TEEE, Elektrotech- 
nischer Verein Osterreichs (Austria). 


JoHN A. Lewis, B.S., 1944, Worchester Polytechnic Institute; Sc.M., 
1948, Brown University; Ph.D., 1950, Brown University; Bell Telephone 
Laboratories, May, 1951—. A member of the mathematical physics de- 
partment, he has been engaged in theoretical investigation of problems 
of fluid dynamics, piezoelectric vibrations, heat transfer, and satellite 
attitude control. He is currently studying problems of hypersonic flow. 
Member, American Mathematical Society, SIAM, l’Unione Mate- 
matica Italiana and the Society for Natural Philosophy. 


KE. A. J. Marcatini, Aeronautical Engineer, 1947, and E.E., 1948, 
University of Cordoba (Argentina); research staff, University of Cor- 
doba, 1947-1954; Bell Telephone Laboratories, 1954—. He has been en- 


CONTRIBUTORS TO THIS ISSUE 3061 


gaged in theory and design of filters in multimode waveguides and in 
waveguide systems research. More recently he has concentrated in the 
study of optical transmission media. Member, IEEE. 


JosEPH I. Ossanna, JR., B.S.E.E., 1952, Wayne State University; 
Bell Telephone Laboratories, 1952—. He has been concerned with low- 
noise amplifier research, feedback amplifier theory and design, satellite 
position prediction in Project Echo, mobile radio fading studies, and data 
processing. Currently he is involved in the operation of the Murray 
Hill Computation Center. Member, IEEE, Sigma Xi and Tau Beta Pi. 


JoHN W. Pan, B.S. in E.E., 1955, University of Cincinnati; Se.D. in 
E.E., 1962, Massachusetts Institute of Technology; Bell Telephone 
Laboratories, 1955—. He received the Communications Development 
Training Fellowship for study at MIT, 1958-1962. He has been con- 
cerned with the process of pulse code modulation and is currently in 
charge of a group engaged in design of PCM terminals. Member, Sigma 
Xi and IEEE. 


Irwin W. Sanpperc, B.E.E., 1955, M.E.E., 1956, and D.E.E., 1958, 
Polytechnic Institute of Brooklyn; Bell Telephone Laboratories, 1958—. 
He has been concerned with analysis of military systems, particularly 
radar systems, and with synthesis and analysis of active and time-vary- 
ing networks. He is currently involved in a study of the signal-theoretic 
properties of nonlinear systems. Member, [HEE, SIAM, Eta Kappa Nu, 
Sigma Xi and Tau Beta Pi. 


R. A. Sempiax, B.S., 1961, Monmouth College; Bell Telephone 
Laboratories, 1955—. He has been engaged in beyond-the-horizon radio 
propagation and three satellite communications projects: Project Echo, 
Telstar I and Telstar II. He has also participated in studies of the effects 
of rain on sky noise temperatures at 6-gc frequency and has recently 
completed an experimental study of the near-field Cassegrainian antenna. 
He is currently engaged in measuring the scattered radiation from vari- 
ous surfaces at 0.6-micron wavelength. 


W. C. Suauson, A.B. and B.S., 1927, Hamilton College; Bell Tele- 
phone Laboratories, 1927—. His first work consisted of design and de- 
velopment of polarized relays and signals. Later he became concerned 
with U and Y type relays, making the original capability studies and 
establishing the original requirements for U relays. During World War II 


3062 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1964 


he was in charge of specifications and drawings for permanent magnets 
for magnetron tubes, and designed special solenoids for electrically con- 
trolled torpedoes and radar scanning devices. He also developed water- 
tight sealed relays for submarine detection apparatus. During the war 
and afterward he took part in the development of hydrogen annealing 
processes for stabilizing the magnetic characteristics of relays. More re- 
cently he has been responsible for the design and development of cost 
reduction items for relays, one of which is described in this issue. He is a 
Professional Engineer of the State of New York and holds three Bell Sys- 
tem patents. 


DAvID SLEPIAN, University of Michigan, 1941-48; M.A., 1947, and 
Ph.D., 1949, Harvard University; Bell Telephone Laboratories, 1950—. 
He has been engaged in mathematical research in communication theory, 
switching theory, and theory of noise, as well as various aspects of ap- 
plied mathematics. He has been mathematical consultant on a number of 
Bell Laboratories projects. During the academic year 1958-59, he was 
Visiting Mackay Professor of Electrical Engineering at the University 
of California at Berkeley. Member AAAS, American Mathematical 
Society, Institute of Mathematical Statistics, IEEE, SIAM and U.R.S.I. 
Commission 6. 


RicHARD L. Townsenp, B.M.E., 1954, Cornell University; M.S.E.E., 
1959, and EH.E., 1960, Massachusetts Institute of Technology; Bell 
Telephone Laboratories, 1960—. He has been engaged mainly in studies 
of data transmission over telephone facilities and in the analysis of error 
control schemes. Member, IEEE, American Mathematical Society, 
ACM and Sigma X1. 


Rosert N. Warts, B.S.E.E., 1955, University of Miami; 8.M., 1957, 
and E.E., 1960, Massachusetts Institute of Technology; Bell Telephone 
Laboratories, 1960—. His initial assignment was development of meth- 
ods for switching multiaddress data traffic. He also has participated in 
the development of automatic calling units and supervised a group en- 
gaged in the exploratory development and analysis of several error 
control systems. Mr. Watts presently supervises a group engaged in the 
development of DATA-PHONE systems. 


JosEepH H. Wesesr, B.E.E., 1952, Rensselaer Polytechnic Institute; 
M.S.E., 1959, George Washington University; Bell Telephone Labo- 
ratories, 1956—. He has been engaged in traffic analysis of communi- 


CONTRIBUTORS TO THIS ISSUER 3063 


cations systems, with particular emphasis on computer techniques such 
as stochastic simulation. He has also been concerned with the systems 
engineering of electronic switching systems for commercial and military 
applications. He presently supervises a Traffic Studies Center group 
primarily responsible for the analysis of military and civilian com- 
munications networks. 


E. E. Zasac, B.M.E., 1950, Cornell University; M.S.E., 1952, Prince- 
ton University; Ph.D., 1954, Stanford University; Bell Telephone 
Laboratories, 1954—. A specialist in applied mathematics and mechanics, 
he has worked on the dynamics of submarine cable laying and recovery, 
elastic wave propagation in solids, theory of elastic stability, and theory 
of dynamical systems. More recently, he has been concerned with satellite 
attitude control studies and computer-made perspective movies. Mem- 
ber, ASME, American Mathematical Society, Sigma Xi, Phi Kappa 
Phi, Tau Beta Pi and Pi Tau Sigma. 


B.S.T.J. BRIEFS 


A Note on a Signal Recovery Problem 
By I. W. SANDBERG 
(Manuscript received July 17, 1964) 


In a recent study! of the recoverability of square-integrable band- 
limited signals that are distorted by a frequency-selective time-varying 
nonlinear system and subsequently are bandlimited to the original bands, 
certain assumptions were made concerning three of the four functions 
of frequency that characterize the linear time-invariant part of the sys- 
tem. These assumptions, which are stated in Section 3.4 of Ref. 1, are 
satisfied in most, but not all, cases of engineering interest. The purpose 
of this note is to report on an extension of Theorem I of Ref. 1 that covers 
cases in which the conditions of Section 3.4 of Ref. 1 are not met. More 
specifically, a proof of the following result is outlined. 


Theorem: Let Lox and ®(Q) be as defined in Section II of Ref. 1. Let A, 
B, C, D, a, ¥(-,-), and P be as defined in Sections 3.1, 3.2 and 3.3 of 
Ref. 1, and, for any f € Lor, let Y[f] denote the function with values 


¥(f(t),t) (—2 <t< ~), 
With 0* the complement of 2 with respect to (— ©, ), and 


E(w) = D(w) for weQ 
= 1 for we, 
let 
ess sup | [H(C — 1) — PAB]"| < ~, 


—O<lW<00 
and ; 
(1 — a) ess sup | E[E(C — 1) — PABY]"| <1. 


—n<W<Ko 
Let ss be an arbitrary element of @(2). Then @(Q) contains a unique 
element s;, and £2, contains unique elements w, v, and s2 such that 


v= As; + Cw, 8. = Ds; -++- Bu, 


83 = Psp, and v= yw] 
fi.e., such that (1), (2), (3), and (4) of Ref. 1 are satisfied]. Further- 
more, there exists a positive constant k, that depends only on A, B, C, 
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D, and a, such that if 
AS, + Cw, So = Ds, + Bu, 


8; = PS, , and 6 = yfw] 


Dd 


where W, d, 52 € Lor and 3, 3; € B(Q), then 
| si — & |] S kl] ss — 8 |]. 
Outline of Proof: 

Let the mapping of £2 into itself represented in the frequency domain 
by multiplication by E(w) be denoted by E, and let K(Q) denote the 
Banach space of two-vector-valued functions of ¢ belonging to 

Ler x @(Q), 
with norm ||- ||’ defined by 
00 ) 3 
Fil = ({ | fult) Pat + | | fo(t) Pay), f -|*| Lon X B(Q). 


Assume that the hypotheses of the theorem are satisfied. To prove the 
first part of the theorem, it clearly suffices to show that @(Q) contains 
a unique element s,, and Lo, contains a unique element w, such that 


Y[w] = As; + Cu, (1) 
and 
83 = Ds; =F PBw, (2) 


in which P is defined in Section 3.2 of Ref. 1. For this purpose, we may 
replace D in (2) by E and write (1) and (2) as 


I-[a? s1aJ-[] 


in which P[w] = Y[w] — w, ¢ is an arbitrary positive constant, and I is 
the identity operator. 
The operator 


ia eo r = 


is a bounded mapping of K(Q) into itself. In view of the first inequality 
of the theorem, it possesses an inverse on K(Q2), and L™ can be repre- 
sented in the frequency domain by the matrix-valued function 


we 1 E —A 
BNO E(¢ —1) — Bia | = PE ce — sh 
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In particular, (3) can be written as 


|, | ~ in|. | + Le (5) 


in which N is the operator defined on K(Q) by 
_| vif 2 
The second inequality of the theorem implies that there exists a posi- 
tive number { such that L™'N is a contraction mapping of «(Q) into 


itself for all ¢ > >. In fact, using Parseval’s identity and the frequency 
domain representation of L™', we find that for all f, g ¢ K(Q), 
|| LUNS — L“Ng ||’ S max (c1, 2) || Nf — Ng |!’ 


= (1 — @) max (41, ¢2) || f — g ||’ 


in which 


Il 


Cy 


ess sup | E[E(C — 1) — PABJ" |, 


and 


I 


co = ¢ ess sup | PB[E(C — 1) — PAB] |. 
In view of the contraction-mapping fixed-point theorem, this establishes 
the existence and uniqueness of the functions w and s; (as well as the 
important fact that these functions can be determined by an iteration 
procedure that converges at a geometric rate). 

The second part of the theorem follows directly from (5), the relation 


al ee _f 0 
[fe] +e te 


and the fact that L™'N is a contraction for ¢ > {. 
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Detection of Weakly Modulated Light 


at Microwave Frequencies 


By M. G. COHEN and E. I. GORDON 
(Manuscript received August 10, 1964) 


Studies of the photoelastic, electro-optic or magnetic-optic properties 
of materials at high frequencies often require the detection of microwave 
modulated light. In many cases, the modulation depth is sufficiently 
small that quantitative measurement becomes difficult if not impossible. 
The purpose of this brief is to describe a homodyne-superheterodyne 
technique which allows measurement of modulation depths of consider- 
ably less than 10-°. 

At such small modulation depths, the light-associated shot noise can 
be large compared to the modulation signal. Under these circumstances, 
it is customary to use synchronous detection techniques following a 
sensitive superheterodyne receiver and to chop the modulation signal at 
some low frequency. This requires extremely good RF shielding between 
the modulation source and the receiver to avoid pickup. The variations 
in amplitude and phase of the pickup produce an unsteady output signal. 
Alternately, one can chop both the light and the modulation and per- 
form the synchronous detection at a sum or difference frequency. In 
any case, the limiting sensitivity is determined by noise originating in the 
photodetector and the receiver. 

The technique described here is considerably simpler and more sensi- 
tive. Fig. 1 indicates the usual synchronous detection scheme using a 
photodetector feeding a microwave receiver with a 30-mes IF strip. A 
reference signal for the synchronous detector is derived from the light 
chopping wheel. The added feature is the injection into the line incident 
on the receiver of a small fraction of the CW modulating signal, taken 
from the input line to the interaction region or modulator and passed 
through a variable attenuator and phase shifter. The amplitude of the 
injected signal is kept at least 30 db larger than that of the pickup. Thus 
the amplitude and phase of the total injected signal, including pickup, 
are essentially independent of fluctuations in the phase and amplitude 
of the pickup. The mixer and IF strip are operated in their linear regions. 
Thus the signal output v, from the final stage of the IF amplifier, which 
is an envelope detector, can be written 


Vo = | o:(1 + T) + bn + 9, + di: | eintecdetiaee-s (1) 
Here »; is the injected signal and T <<< 1 represents that part of the 
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Fig. 1 — Block diagram of the homodyne-superheterodyne detection scheme. 


injected signal which, because of imperfect isolation, is incident on the 
photodetector and is reflected with a component at the chopping fre- 
quency (because the photodetector RF impedance is dependent on the 
light intensity as, for example, in a photodiode) 6,, is the modulation 
signal from the light, 0, is the light-associated shot noise from the 
photodetector and v, is the receiver equivalent input noise. The tilde 
over some of the quantities indicates that they are chopped at the refer- 
ence frequency of the synchronous detector. By far the largest signal is 
v;, SO ¥. can be written to a very good approximation 


v = |v | + 2 Rel + 2(|6,|/| |) cos 6 + terms of order 
| 5. |’/| vi |, | o, P/| v¢ ? and higher] 

lv;| (1 + 2 Ref) + 2|5,, | cos 6 + terms of order 
| 8 P/| 0s |, |e P/| | ete. 


in which @ is the phase angle between 6, and v;, and Re indicates the 
real part. All other cross terms have a time average of zero; the only 
signals which are coherently related are v; and 0,, . Thus all noise terms 
can be made arbitrarily small compared to |, | by making | v; | large. 
The contribution from the chopped term containing I can be made 
arbitrarily small by using more isolation in the line immediately follow- 


(2) 
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ing the photodetector. In the case of a photomultiplier, this is not nor- 
mally necessary. Even when these terms are not completely negligible 
compared to d,, cos 6, their effect can be eliminated by varying the phase 
of the injected signal so that cos @ takes on the value +1. The only out- 
put which depends on @ is the desired modulation signal. Thus one need 
only take the algebraic difference between the extreme deflections of the 
synchronous detector as @ is varied. The fact that there is a synchronous 
detector deflection which depends on the phase of the injected signal is 
an unambiguous indication of microwave modulation on the light. 

All aspects of (2) have been verified in the course of photoelastic and 
electro-optic modulation experiments above 150 mc by placing variable 
attenuators in various parts of the circuit to see if the variation of each 
term had the proper dependence. Modulation depths of 10-* could be 
easily and accurately determined with integration times following the 
synchronous detector of less than one second. No special shielding was 
required. It should also be noted that the output of the synchronous 
detector is proportional to the RF amplitude rather than the square of 
the amplitude as in most other radiometer detection schemes. Thus, the 
output is proportional to the amplitude of the light modulation rather 
than its square. 


An Improved Error Bound 
for Gaussian Channels 

By A. D. WYNER 

(Manuscript received August 18, 1964) 


I. INTRODUCTION 


The problem considered here is that of coding for the time-discrete 
amplitude-continuous memoryless channel with additive Gaussian noise, 
the code words lying on the surface of an n-dimensional hypersphere 
with center at the origin and radius ~/nP. 

We define a code as a set of M real n-vectors @ = (a1, %2,°°* , Xn) 
satisfying the (‘‘energy”’) constraint, . 


ate = nP. (1) 


The transmission rate R is defined by M = e"", so that R = (1/n) nM. 
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The code words are transmitted through a channel in which they are 
corrupted by noise, the received word 7 = (y1, y2,-°*+ , Yn) being the 
vector sum of the transmitted word and a noise vector 2, i.e., 


y= (Yrs Yay cee ay = (a1 + 21, U2 + 22, °° 5 an ten) = E+ 2. (2) 


The components of the noise vector z.(k = 1, 2, --- , n) are assumed 
to be statistically independent Gaussian random variables with mean 
zero and variance N. 


The signal “energy” is >> x = nP, and the expected noise 
k=1 


“energy” is E[>-x 2] = nN, so that the signal-to-noise energy ratio is 
P/N. This quantity is also the signal-to-noise ‘average power.” 

It is the task of the decoder to examine the received vector % and de- 
cide which code word # was actually transmitted. If P.; is the probability 
that the decoder makes an incorrect choice when code word 7 is trans- 
mitted (7 = 1, 2,3, --- , 7), and if each of the M code words is equally 
likely to be transmitted, then the over-all probability of a decoding error 
is 


1 M 
P e— Mu aE a eie (3) 
It is not hard to show that the decoding scheme which minimizes P, for 
a given code is the minimum-distance decoder, where the decoder selects 
that code word which has smallest Euclidean distance from the received 
vector and announces that word as the one which was transmitted. 
Thus if 9 = (y1,Y2,°°* Yn) is the received vector, the decoder an- 
nounces that code word % which minimizes (with respect to £) 


(tr — yx)” = Dom! + Dye — 2 Do te. 


Me 


az) = 


ll 
» 


k 


Since Dy: a; = nP, d(#%) is minimized when yor tye is maximized. 
Hence minimum-distance decoding is equivalent to selection of that 
code word which minimizes the angle in n space a(#,g) between £ and 
g, where 


pa LYk 
Cae wy 


The behavior of codes for this channel has been investigated in detail 
by Shannon,’” who has shown the following: 


(4) 


cos a(%,7) = 
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Fundamental Coding Theorem: Let R be any number such that 
R<C =3ln [1 + (P/N). 


For each n, there exists an n-dimensional code with rate R(M = e"”) such 
that the error probability 1s 


P. Zs gee, (5) 


where the exponent E(R) (called the “‘reliability”’) 1s positive when R < C 
(so that P, “> 0). 
Shannon? also obtained estimates of the best possible exponent 


E(R) = lim — (1/n) In P. . 


In this note we establish the following upper bound on E(R) (i.e., a 
lower bound on P,): 


P or 
pai 
E(R) Sy ve (6) 
For small rates R, (6) is sharper than the bounds of Ref. 2. Inequality 
(6) is plotted together with the estimates on H(R) in Ref. 2 in Fig. 1. 





Fig. 1 — New upper bound on E(R) vs & for P/N = 4 (solid line). The bounds 
on E(£) of Ref. 2 are in dotted lines. E(R) lies in the shaded area. 
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II. DERIVATION OF THE BOUND 


Consider an n-dimensional code with M code words #,, #2, --- , tu. 
Let @ be the minimum angle between pairs of code words a(#; , %;) 
(t #7). Denote by 6,(J7) the largest possible minimum angle @ in an n- 
dimensional code with M/ code words, and by 


sn(M) = 2+/nP sin [6, (M1) /2], 
the largest possible minimum distance between pairs of code words in an 
n-dimensional code with M code words. Paralleling an argument of 
Shannon [Ref. 2, pp. 647-648] it is not hard to show that the error prob- 
ability satisfies 


P, = 3 (- ne sin ae), (7) 


where 


] xz 
&(2) — val oe ay 


is the cumulative error function. 
We now employ the following result of Rankin’ to obtain an upper 
bound on 6,(A/): 





rT (° 7 ) sin 8 tan 6 
Ms ——?-Y (8) 
Ab (;) | (sin gy)” *(cos g — cos B)de 

2) Jo 


where 8 = sin ~/2 sin (6/2), and @ is the minimum angle in an n-di- 
mensional code with M/ code words. Taking logarithms of (8) yields 





n—-l 

| ee 1 a +) 

In M S —- ln ~ sin B tan 6 + - In —*——— 
n 2 n 


1 
© 


8 
_ * in | (sin g)” "(cos g — cos B)de. 
0 


R= 


It is shown in the appendix that for large n we may approximate the 
upper bound of (9) by —In 1/2 sin (0/2), yielding 


1 
sin 5 < Va’ (10) 
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Since for large n, a code with M/2 points has the same rate as one 
with M points (10) and (7) yield (for large 7) 


P, > 1@ (- uF) (11) 
e= 2 N /2 : 

Using the well known asymptotic form of the cumulative error func- 

tion ®(—2) ~ (1/xv/2r)e"” (large x) we obtain 


E(R) = lim — 1 InP, < = ga (12) 
n> n N 


APPENDIX 


We must show that the limit of the right-hand member of inequality 
(9) as n tends to infinity is — In +/2 sin (6/2). The first two terms of 
this quantity both tend to zero as n becomes large, so that we must 
show the following: 


Let 
8 
= / sin” ’ » (cos g — cos B)dy, 
0 
then 
ee , 
E= lim ~ In In = In sin Bg. 
Proof: 


8 
(a) In S i sin” ’ 8 (cose — cos 8)dy = sin” 8 [sin B — B cos 6], 
0 





so that 
ae a < jaa Za Re + 1 In [sin 8 — 6 cos B] — In sin 8. 
n n n 
8 
(b) In & [ sin” * » (cosy — cos 8)de 
B~(8/n) 


(18) 


8 
[ (cos ¢ — cos B)dg. 
B—(B/n) 


IV 


sin” ” (6 — ®) 
n 


Now 
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8 
r= f (cos ¢ — cos B)dp = sing ~ sin (¢ — 2) — 8 cos p 


— (B/n) 


B 


‘ ‘ B ‘ B 
= — es eee, B. 
sin B sin 6 cos— + cos @ sin COS | 


Expanding sin (8/n) and cos (8/n) into power series in (8/n), we ob- 


tain 
ES sin a| & -+- o(4)] — B sin afl + o(1)] 
2n? n? 2n? . 
Thus 
1 Oe ere ae 1 n 
~InI =~ Ingjsin8 +> In (1 + o(1)) + 0. 


From (13) we have 


n— 2 





ee ae 
n 


In sin (8 - 2) +4ins*msing. 
n n 
Therefore Z = In sin 8, which completes the proof. 
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